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Prologue 



Quantum mechanics is weird. I had never realized this until the spring of 2003, when 
dr. Maassen clearly and carefully explained to me, in a two-hour lecture, why quantum 
mechanics cannot be a simple hidden variables theory. 

This was something of a shock to me. But as the shock subsided, I realized that 
weirdness and lack of objective determinism were the least of my problems. There is the 
horrible threat of inconsistency! 

Quantum mechanics is intrinsically probabilistic. Observation however involves a sin- 
gle outcome. In order to handle this consistently, quantum systems must exhibit a so- 
called col·lapse of the wave function. There are loads of specific theoretical models which 
show collapse on a measured system. But is this really necessary? Or would it be possible 
to perform a measurement without collapse? 

Let's kill the tension right away. The answers are yes and no respectively. If you bear 
with me for E01 short and exciting pagès, l'U prové to you that transfer of information 
out of a system always causes collapse on that system. Along the way, we will gain 
quantitative insight into the balance between information gain and state disturbance. 

Well then. Now that l've skillfully tricked you into reading the rest of this thesis, I am 
left with the pleasant task of thanking people. It goes without saying that I am grateful 
to my friends and family for such diverse matters as giving birth or money (which only 
goes for a fairly restricted class of family members) and tolerating or even supporting 
my (rather obnoxious) enthusiasm. (The latter goes for anyone having had even remote 
contact with me over the past year.) 

But here and now, I would like to express my gratitude to those who made a direct 
contribution to this thesis: Prof. Ronald Kleiss for agreeing to be my official first and 
unofficial second supervisor. Prof. Klaas Landsman for a careful reading of this text, and 
for many useful suggestions. Màdàlin Gutà for suggesting a simple proof of lemma l(ÏT|l in 
the case of completely positive maps, putting me on track for proposition pB]l, Janneke 
Blokland for useful advice on the editing. And most of all, I would like to thank dr. Hans 
Maassen. 
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The two-hour lecture I told you about was part of his course in 'Quantum Probability, 
Quantum Information Theory and Quantum Computing' which lies at the very heart of 
this thesis. Indeed, insiders will recognize chapter as a mere extension of the lecture 
notes |Ma,aj . I am thankful for the excel·lent guidance and for the countless conversations 
we had, shaping my perception of quantum mechanics into its present state. They were 
sometimes slightly confusing, but always pleasant and fertile. I have learnt much from 
Hans over the past year, and I would be proud if his style may be seen, reflected in my 
writing. 

Bas Janssens 
December 2004 
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Definitions and Conventions 



p 


Generically denotes a classical probability distribution. 


Ep (d) 


Ep(d) = a(w)P(dw), the expectation of random variable o 




under P. 


Var P (o) 


Varp(o) = Ep(o 2 ) - E P (a) 2 , the variance under P of random 




variable a. 


Cov P (a, b) 


Cov P (a, b) = Ep(ab) - E P (a)E P (b), the covariance under P of 




a and b. 




If V C C, then ^(V) is the space of continuous functions 




on V. 


sé, 3$,^, 9 


Script letters denote C*-algebras. 


a 


Automorphisms are usually denoted by a. By an automor- 




phism of a C*-algebra, we always mean a C*-automorphism, 




i.e. we assume a(A) J< = a(A^). 




If A e sé, then 'rf(A) C sí is the C*-sub-algebra generated 




by A and I. 


y{sé) 


Denotes the convex state space of normalized positive linear 




functionals sí — > C. 


~z 


The complex conjugate of a complex number zeC. 


Ai 


The Hermitean conjugate of A £ sí. 




SL4 = \{A + À*): the real part of A e sí. 




QA = ^(A — Aï): the imaginary part of A € sí. 


M* 


If M is a positive linear mapping 3§ — » sé , then its dual M* 




is a y{sí) -» y{3§) mapping defined by M*(p) = p o M. 


Var„(A) 


V&r p (A) = p(A^A) — p(A)p(A), the variance under p of 




A e sí. 


Cov p (A) 


Cov p (A, B) = p(A J< B) — p(A)p(B), the covariance under p of 




A,B e sé. 


Spec(,4) 


The spectrum of A £ sí . 


y 


Y' = { A e sí | [A, Y] = }, the relative commutant of Y. 
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The àlgebra of n x n-matrices acting on C' 



p+ = 
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Introduction 



In the prologue, I already made a brief sketch of the subject of this thesis. Allow me to 
add a few details. 

This is a Master's thesis in mathematical physics, written in the period September 
2003 - October 2004 at the Radboud University Nijmegen, under supervision of dr. Hans 
Maassen. Its aim is twofold: 

- First of all, I intend to prové general theorems, showing that state col·lapse on a 
measured system is a necessary consequence of transporting information out of that 
system. 

- Afterwards, we shall investigate the balance between information gain and state 
disturbance in a more quantitative way. 

The first point is in contrast with authors like Joos, Zeh and Zurek who, if I understand 
correctly, endeavour to find specific models of decoherence on a system, independent of 
the information transfer. 

Another group of authors (Hepp, Lieb, Sewell, Rieckers) transports information to a 
central pointer in an infinite system. Although centrality of the pointer enables them to 
model a global col·lapse on all observables, it inhibits them from using automorphic time 
evolution. 

I on the other hand will use finite (not necessarily finite dimensional) systems, trans- 
porting information to non-central pointers. I do not restrict attention to automorphic 
time evolution, but it is allowed as a special case in each proposition in this thesis. We 
will see that collapse of the wave function then automatically occurs on the examined 
system. But we will also show that an approximate collapse occurs on a much wider 
range of observables, including the observables of the measurement apparatus. 

This brings us to the similarities with this second group of authors. Most of chapter|2l 
is based on a most original idea, due to Hepp, that collapse has to do with the difference in 
size between the pointer (macroscopic) and the observable on which collapse is supposed 
to occur (microscopic). In fact, the whole point of using finite systems was originally just 
to get a quantitative estimate of how the idealization of infinite systems is reached in the 
realistic case of a large but finite system. Exactly how large must the system be? What 
observables defy collapse? You will find answers in chapter[2| 
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But before entering the bulk of this thesis, I would like to caution the reader about 
two points which might seem essential at first sight, but are in fact merely a matter of 
personal preference of the author: 

- In the postulates of quantum mechanics, systems are modelled by C*-algebras. This 
is not essential: I might just as well have chosen von Neumann algebras. If you 
are not familiar with operator àlgebra techniques altogether, you may take in mind 
38(34?), the àlgebra of all bounded operators on the Hilbert space Jí? . This example 
will serve you well throughout the text. 

- I also wield a rather unorthodox interpretation of quantum mechanics. I do this 
simply because it is in my eyes the simplest possible interpretation. Do not be 
distracted: the issue of interpreting quantum mechanics is quite separate from the 
issue of state collapse after information transfer. If you do not like my interpretation 
of quantum mechanics, simply take your own favourite interpretation and apply it 
to the mathematics in this thesis. The result will probably be satisfactory. 

Finally, a short note on source material. This thesis contains a grand total of 26 
lemmas and propositions, plus another 10 corollaries. Of course not all of these are new. 
There are three possibilities. 

Sometimes, I prové theorems already proven by others before. In that case of course, 
I refer to this person explicitly. I have also formulated a number of results which have 
been widely known for a long time. In that case I explicitly mention that it is a 'standard 
result'. This leaves a total of 19 lemmas and propositions plus 7 corollaries that are neither 
attributed to one particular person nor explicitly labelled 'standard result'. These are of 
my own invention. The reader will understand however that there exist no guarantees 
that no one else has invented them before. If so, I have not been able to track this down. 

Now, without more ado, we finally move from the disclaimer to the actual physics. 
Enjoy the ride. . . 
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Chapter 1 

Quantum Measurement 



In order to investigate quantum measurement, we dwell on the foundations of quantum 
mechanics for a short while. 

1.1 Postulates of Quantum Mechanics 

Regardless of their interpretation, we will postulate the existence of the three mathemat- 
ical protagonists of quantum theory: an àlgebra, a state and a one-parameter group of 
automorphisms. 

Postulate 1 A quantum mechanical system will be modelled mathematically by a unital 
C* -àlgebra sí ', the àlgebra of observables. 

Quite often, si = 3§{3l?), the àlgebra of all bounded linear operators on some Hilbert 
space . If you're not familiar with C*-algebras, this is a good example to keep in mind. 
In general, any C*-algebra si has a faithful representation on some Hilbert space 3f , see 
ISÉE1 p. 281]. 

Postulate 2 A physical state of this system will be modelled mathematically by a (nor- 
malized) positive linear functional p on si . 

The set of all possible states on si makes up state space, .y{si). A unit vector \if)) g 3V, 
for example, induces a state p on £§(3tf) by p(A) = (ip\A\ip). Respecting conventional 
abuse of language instead of common sense, we will not always distinguish between vector 
states and vectors. 

But these are not the only states allowed for the system. If for each positive integer i, 
we have a normalized vector \tpi) S Jif and a number pi € [0, 1] such that YlTLiPi = 1> 
we may form the state p(A) = Pí{Í ) í\M' i I ) í) on @ This is a positive linear 

functional on si as well, and thus perfectly acceptable as a physical state. This particular 
state happens to be normal, i.e. continuous in the weak operator topology. But we also 
accept non-normal functionals as states, according to the postulate above. 

Postulate 3 Time evolution in an isolated system is modelled mathematically by a one- 
parameter group of automorphisms of si : 1 1— ► at- That is, ott+ s = O-t ot s for all t, s £ R. 
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Let a* : 5?{sé) — > J?(sé) denote the dual action of a on state space: a* (p) = poa. Then 
a state p on time to will evolve to a state a*, tl _ t \p on time t\. 

A unitary U E sé induces an automorphism a of sé by a(^4) = WAU. Usually the 
one-parameter group of automorphisms describing time-evolution is induced by a one- 
parameter group of unitàries 1 1— > Ut G sé. So after a time í the observable A will evolve 
to at{A) = U^AUt- Translating to the Schrodinger picture, a state p will evolve to pt- 
Pt{A) = cttp(A) = p([ljAUt). If p is the vector state induced by then it evolves to 
Pt(A) — (ip\U~t AU t \ip) ■ In other words, p t is the vector state induced by U t \ijj)- 

Induced Probability Measures 

States on a C*-algebra have the pleasant property of inducing probability measures. This 
is clear from the following Standard result: 

Proposition 1 (Induced Probability Measure) Let sé be a C* -àlgebra. Let X G sé 
be Hermitean. Then each state p G y{sé) defines uniquely a probability measure V Pt x 
on the Borel a -àlgebra of Spec(X) such that p(f(X)) = J f(x)F Pt x{dx) for all f G 
^(Spec(X)), 

Proof : 

By the Gel'fand transform (see e.g. K&R, p. 271]), we have an injective C*- 
homomorphism / i— > f(X) of '^'(Spec(JíT)), the continuous functions on the 
spectrum of X, into sí '. We compose this with the state p : sé — > C to yield 
a functional E on Sf(Spec(X)). In short, E(/) = p{f(X)). E is positive: if 
/ > 0, then f(X) > in the operator ordering, hence E(/) = p(f(X)) > 
since p is a positive functional. 

By the Riesz representation theorem (see |Cohl p. 209]), E defines a unique 
Borel measure P Pi x on Spec(X) satisfying E(/) = / f{x)f Py x{dx). That this 
is a probability measure indeed can be seen from P Pi x(Spec(X)) = p(I) = 1. 

q.e.d. 

For example, consider the physically relevant case of a normal (i.e. weakly continuous) 
state p on a von Neumann àlgebra sé . Then X defines 1 a projection valued measure 
V h-> P(V). In this setting, P p , x is simply defined by P p ,x{V) = p(P(V)). In case 
p is a vector state and X has discrete spectrum with non-degenerate eigenspaces, 
X = J2i this amounts to F PjX ({xí}) = \ (ipi\ip}\ 2 . 



X V is a Borel subset of Spec(X). In this thesis, all subsets of spectra will be assumed Borel measurable. 
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1.2 Interpretation of the Postulates 



Up until now, we only have the existence of mathematical objects. In order to link 
mathematics to physical experiment, we seek an interpretation of the postulates above. 

1.2.1 An Inconsistent Interpretation 

Proposition cries out for an interpretation of postulates Q and {21 . The first that 
comes to mind would be: 

Interpretation 1 

- A quantum mechanical system will be represented by a unital C* -àlgebra sé ', the 
àlgebra of observables. 

- Each random variable a is represented by a Hermitean A € sé . 

- The random variable a objectively takes vàlues in Spec(A). If the system is in state 
p € ^{sé), then the probability that a takes value in V is F p ,a(V). 

This interpretation allows us to intèrpret the state p{A) = X^üi Pi tpi) as a 
system in state |^) with probability 2 p$. 

Unfortunately, the interpretation above is inconsistent, at least for sé = M2® M2. For 
each Borel subset V C Spec(A), P P: x(V) gives the probability that a takes value in V. 
Therefore a is a random variable on the probability space (Spec(A), F p> a(V)). Similarly, 
b is a random variable on the probability space (Spec(B), ¥ P) b(V)). 

But if a and b both take objective vàlues, then there must exist some probability 
distribution P on Spec(A) x Spec(_B) such that P(V x W) is the probability that a lies 
in V and b in W . So a and b must be random variables on the same probability space 
(Spec(A) x Spec(B),P), having P p ,a and f p ,B as marginal probability distributions. 

This means that each set of random variables has to satisfy Bell's inequalities. (See 
jJaul p. 116] for a very thorough and jEEU p. 673] for a very accessible version.) But in 
M2 <8> M2, Bell's inequalities are violated for certain choices of p. As a result, interpreta- 
tion is inconsistent. 

This is not exclusively the case for si = Mi ® Mi- Interpretation Q is inconsistent 
for sé = m(,?e) with dim(J?) > 2 (see |EES|). 

Induced Joint Probability Measures 

In order to pave the way for a consistent interpretation, we will extend proposition |T|I 
with the following Standard result: 



2 This interpretation of p is slightly less straightforward than it seems at first sight, since the decom- 
position of p into pure states may not be unique. 
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Proposition 2 (Induced Joint Probability Measures) Let X ,Y G sé be Hermitean 
such that [X, Y] = 0. Then each state p G -y(sé) defines uniquely a probability mea- 
sure P p ,x,y on the Borel a-algebra of Spec(X) x Spec(F) such that p(f(X)g(Yj) = 
J f{ x )9(yWp,x,Y(dx,dy). In particular: 

E,(X)=p{X), E r (Y)=p(Y) 

cov P (X, Y) = p{XY) - p{X)p{Y) 

Proof 

For continuous / and g on the spectra of X and Y, we have once again 
f(X) and g(Y) by the Gel'fand transform (see Kk\i p. 271]). We define 
a functional E on %?(Spec(X) x Spec(y)) = c Í(Spec(X)) ® ^(Spec(y)) 
by E(/ ® g) = p{f(X)g(Y)). E is positive: if / ® g > 0, cho ose f ,g > 
0. Then, because [f(X),g(Y)} = 0, f{X)g{Y) = /f{X)g{Y)^J{X) > 
in the operator ordering. Now since p is a positive functional, E(/ ® q) = 
p{JJ{X)g{Y)JJ(X))>Q. 

By the Riesz representation theorem (see |Cohl p. 209]), E defines a unique 

Borel measure P p .x,y on Spec(X) x Spec(F) satisfying E(/ <g> g) = 

J f(x)g{y)F Pi x,Y{dx, dy). Of course P p ,x,y(Spec(X) x Spec(yj) = p{I) = 1. 

q.e.d. 

For example, let p be a normal (i.e. weakly continuous) state on a von Neumann 
àlgebra sé. Then X and Y define commuting projection valued measures V i— > P(V) and 
W i-> Q(W). In this setting, P p x y is simply defined by the formula V p xy(V x W) = f 
pÇP(V)Q(W)). 

1.2.2 A Traditional Interpretation 

A Standard interpretation of postulates Q through Q is the following: 
Interpretation 2 

- A quantum mechanical system will be represented by a unital C* -àlgebra sé ', the 
àlgebra of observables 3 . 

- At any fixed time, there is one state p S S^[sé), representing all knowledge concern- 
ing sé . 

- Each observable is represented by a Hermitean element A of sé 4 . 

- There is an action called 'measurement ' . Observables only take on objective vàl- 
ues if they arc measured. Joint measurement of commuting observables A and B 
yields vàlues of A in V C Spec(A) and of B in W C Spec(_B) with probability 
P p ,a,b(^x W). 

3 In this case, we will denote both system and àlgebra by sé . 

4 Again, both the observable and the Hermitean element will be referred to by A. 
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- Time evolution on an undisturbed system sé is represented by a one-parameter group 
of automorphisms of sí : 1 1— ► at. A state p at time to will evolve to the state °$ti-t )P 
at time t\ . 

Bell's inequalities do not apply here because it is not possible to perform a simultaneous 
measurement on non-commuting observables. One problem solved. 

In order to intèrpret a state p on a system sé ', an outside observer is introduced, 
performing this abstract 'measurement of A\ This has the effect of forcing A to take on 
an objective value. Neither the observer, nor the measurement are described within the 
framework of quantum mechanics. But they do have a physically observable effect on the 
system. 

State Reduction 

We will demonstrate this with a simple example. Let sé be M%, the àlgebra of 2 x 2- 
matrices acting on = C 2 . This describes an electron, having only spin-properties. Let 
the observable Abe <r z £ M 2 , the spin in the z-direction 5 . a z has spectrum Spec(a z ) = 
{1,-1}. Suppose that a z is measured. According to interpretation <j2j, either a z — 1 or 
a z = —1. (With probabilities P PlíTz ({1}) and P PlC r z ({— 1}) respectively.) 

Suppose that the measurement is repeatable. This means that a second measurement 
of a z , performed immediately after the first, would yield the same result. Then after 
measurement, knowledge of the system has increased: if the measurement has revealed 
cr z = 1, then we know that any future measurement of a z will yield a z = 1 again. Accord- 
ing to interpretation JJ, we must now describe the system by a different mathematical 
state than before, one yielding a z — 1 with certainty. The only state on sé which does 
this is the vector state \ip+). This change of state forced by measurement is called state 
reduction. 

Classical State Reduction 

In classical probability theory measurement is also possible, and the reduction it produces 
is called 'conditioning'. A classical spin-system is described by a probability distribution P 
on a classical probability space fi = {+1,-1}. Repeatable measurement can be performed 
on the random variable o z : fi — > E defined by <t z (ui) = ui. If a z = 1, the observer will 
update P to the conditioned probability distribution P( • \ [a z = 1]) defined by 

FÍVIfc-lDtfï^J^-ÍKtV) (1.1) 

where 6+i is the point measure on ui = +1. Another observer, unaware of the measurement 
outcome, will describe the system by a distribution P' = J2x=± i^([ a z = x])V(u\[a z = x] ). 
This equals the original distribution P. So in a classical probability space, state reduction 
is subjective. It can be attributed entirely to the increase of knowledge of the first observer. 

5 For notation on spin systems, see nagelívl 
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Quantum State Reduction 

In quantum mechanics the situation is radically different: objective collapse after mea- 
surement can be verified experimentally. Suppose Mi is in vector state a\t/)+) + 0\tp-). 
Repeatable er 2 -measurement is then performed on M%. u z — \ will occur with probability 
|a| 2 and a z = — 1 with probability \(3\ 2 . So after measurement, the system is in state 
ip+ with probability \a\ 2 and in state ip- with probability \/3\ 2 . The situation is now 
objectively different from the one before: whether we know the value of a z or not, we may 
perform ovmeasurement. In both ip + and tp-, the probability of finding a x ~ 1 equals 
1/2. But before measurement, in the state a\ip+) + f3\ip-), this probability would have 
been 1/2 + &(«/?). 

So experimental verification of collapse can be achieved as follows: start with a system 
in state 1 / v / 2|V'+) + l/v2|V'-)< The first observer measures a z , the second a x . This is 
repeated a number of times. As soon as the second observer measures a x — — 1, the point 
is made. Collapse is verified objectively. 

In summary, repeatable measurement of a z always causes the state of M2 to jump: 

- If the observer learns that a z — +1, the state jumps from a\i/j+) +/?|í/>_) to ip+. We 
will call this change 'state reduction' 

- If the observer is ignorant of the outcome, the state jumps from the vector state 
a\tp + ) + /3\ip-) to the mixed state \a\ 2 \tp + ){tp + \ + \(3\ 2 \ip-)(ip-\. We will call this 
change 'state collapse' 

In the literature, each jump is commonly referred to as both collapse and reduction. In 
order to avoid confusion, we shall keep these notions separate. 

Comments 

There is a sharp and physically observable schism between the situation before and after 
measurement. Therefore, it is important to know if measurement takes place and if so, 
exactly when 6 . In practice, there is hardly any doubt as to when it takes place. And if 
you feel comfortable with interpretation (2J, you may read the rest of this thesis as an 
attempt to explain why, in practice, the exact point in time where the actual reduction 
takes place is not of much importance. 

But personally, I feel rather uncomfortable with the need for outside observers, not 
described within quantum theory, exerting influence on a system that is described by 
quantum theory. I would like my physical theory to be a universe in itself. It should 
describe all the observables that can be measured. But also all observers, and the act of 
measurement itself. 

1.2.3 My Favourite Interpretation 

First of all then, we want to describe all interference with a system sé within the frame- 
work of quantum mechanics. This does not mean that time evolution on sé is always 

6 J.Bell puts it like this (see IBell ): "... so long as the wave packet reduction is an essential component, 
and so long as we do not know exactly when and how it takes over from the Schrodinger equation, we do 
not have an exact and unambiguous formulation of our most fundamental physical theory." 
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automorphic. But it does mean that there is always a system ^ D sí such that time 
evolution is automorphic on Sl. Think of 12l as the entire universe, if you have to. 

Secondly, observables never take on objective vàlues at all. Physics is not about 
objective events. Physics intends to predict the observations made by observers. So a 
physical theory should have: 

- A list of all observers c € . 

- For each a list of observables that are directly observed by ^ . 

Then it should predict the probabilities of the observations made by each separate observer 
without having to make any reference to other observers or objective reality. 

Interpretation 3 

- There is one largest universal system. It is represented by a C*-algebra 2. 

- Each observable is represented by a Hermitean element A of 3>. Each observer is 
represented by an abelian C*-subalgebra ^ C & . directly observes all Hermitean 
A in^é '. cannot directly observe A if A^ c é'. 

- At any fixed time, there is one p £ y(3>) representing the physical state of 3>. Each 
direct observation of any A £ ^€ made by has a value in Spec(.A), the spectrum 
of A. If A, B £ <é ', then the probability that c € observes a value of A inV C Spec(A) 
and a value of B inW C Spec(i?) is given by V p ,a,b{V x W). 

- Even while observation takes place, time evolution is represented by a one-parameter 
group of automorphisms of S>: 1 1— > at. A state p at time to will evolve to the state 
a* ti _ to jp at time t\ . 

All A £ are observed by ^ and all probabilities of finding joint vàlues are given by the 
theory. This means that the observables in ^ may be considered random variables on 
some classical probability spacc. If A £ 2l and B £ *3) do not commute, then they cannot 
be directly observed by the same observer. Both are random variables, but not on the 
same probability space. Therefore Bell's inequalities do not apply. 

Each subsystem of 3) is of course represented by some subalgebra sé C Sl . If this 
subsystem happens to be invariant under the time evolution of 3), then we can regard sí 
as an isolated system with time evolution a.t\sï ■ 

Direct and Indirect Observation 

Notice that one single observer ^ cannot directly observe &\\ A £ 3> iï 3> is not abelian. 
Suppose for example that the observer is an eye. This eye observes directly the voltage 
on each of its neurons. Indirectly, it can also observe say a painting on the other side 
of the room: rays of light carry information from the painting to the retina and the eye 
observes voltages in the retina directly. There is a radical difference between direct and 
indirect observation. 

Direct observation is the most primitive form. It is needed to link mathematics to 
experience. It is restricted to observables A in the observer and it does not result in 
any objective col·lapse. 
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Indirect observation of observables outside is possible. However, this requires some 
pre-formed image of the outside world: the eye must trust photons to travel in straight 
lines. We will call this indirect observation measurement, and we will come to it later on. 



Reduction and Collapse 

Given a state p on sé , we now formally define its reduced state py on rf: 

Deflnition 1 (Reduced State) Let sé be a C*\-algebra. Let Y e sé , let p <G y(sé) 

and let p(Y 1f Y) ^ 0. Then we define 7 the state py on sé by 

py{a) = Tïrwy 

Suppose that a countable decomposition { Vi \ i G 1} of the spectrum of Y is given. 
This means that Vi are Borel subsets of Spec(y) such that Vi CiVj =0 for i ^ j, and 
\J ieI Vi = Spec(F). Then we also have a collapsed state C* p: 

Deflnition 2 (Collapsed State) Let sé be a von Neumann àlgebra. Let Y € sé be 
Hermitean, and let {Vi \ i £ 1} be a countable decomposition of its spectrum. Then if 
p € ,y(sé), its collapsed state C*{p) is defined by 

C*(p)(A)^ p(J2P(Vi)AP(Vi)) 
i 

where V ^ P(V) is the projection valued measure ofY. 

Note that C*(p)(A) = ^j^ > p,y(^)/ 9 P(^»)(^)- The collapsed state is the sum of reduced 
states, weighed over the probability distribution. 

Conditioning 

Suppose that p is a normal state on a von Neumann àlgebra sé . Suppose ^ directly 
observes both A and B so that [A, B] = 0. Then A has projection valued measure 

V i ► P(V) and B has commuting projection valued measure W i— > Q(W). One can then 
calculate the probability distribution of B £ ^ provided that ^ observes a value of A in 

V C Spec(A): 



? PiAiB ([BmW]\[AmV]) 



P Pi a,b([5íii^ and AmV]) 

P P ,A,B([AmV]) 
p(P(V)Q(W)) 

p(P(V)) 
p(P(V)Q(W)P(V)) 

p(P(V)) 
PP(V)(Q(W)) 
F p ^ v)tB ([BmW}). 



We will see that if p(yty) = 0, there will never be any need for a reduced state. Prom now on, when 
mentioning reduced states, I will tacitly assume their existence. Even in theorems. 
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The reduced state p-p(v) induces the conditional probability measure on any B <G A' , i.e. 
any B such that [-6,^4] = 0. We thus have an interpretation of pp{V) considered as a 
state on A 1 . 

A Benevolent Word of Caution to the Reader 

Since this is my thesis, I will proceed with my favourite interpretation. But keep in mind 
that this is merely a way of interpreting the mathematics to come. As such, theorems are 
universal and do not hinge on any interpretation. 
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1.3 Measurement 



We have stated that an observer ^ can observe IG® indirectly, even if X $ c é > . This is 
accomplished by transferring information from X to some Y G c tf, the so-called 'pointer 
observable', and then observing Y directly. If & is in state p, time evolution at must be 
such that P Q *( p ).y = P p ,x: the probability distribution that finds when observing Y at 
time t exactly equals the one any would find when observing X at time 0. 

Since we only need the automorphism at at the fixed time t when measurement is 
completed, we will drop the suffix t from now on. 

1.3.1 Automorphic Measurement 

If a is such that a(Y) = X, then P Q *( p ),y = P P ,x for all p G &{&)• It is immedi- 
ately clear that the averages are the same, a*(p)(Y) — p{aiY)) = p(X). But since a is 
an automorphism, a{f{A)) = f(a(A)) for all A G 9 and / G ^(Spec(A)). Therefore 
the expectation vàlues E used in the proof of proposition to construct the probabil- 
ity measures are automatically identical: a*p(f(Y)) = p{f{X)). Then the probability 
distributions themselves must be identical. 

Example 

Let S> — M 2 (8>M 2 . Think of M 2 ®I as an electron with spin. X — a z ®I will be measured. 
Think of 1 Mi as a small computer memory, capable of storing one bit of information. 
^ is the commutative àlgebra generated by Y = I ® a z : the actual memory. a is the 
automorphism defined by a(A ® B) = B ® A. Then a*{p){Y) = p{a{Y)) = p(X): the 
information that was in the electron prior to measurement has now arrived inside "té. 

Of course this is not repeatable: the states of the electron and the computer memory 
are interchanged, so that a second measurement of X will in general yield a different 
result. 

1.3.2 More General Measurement 

For automorphic measurement, we require that F a *(a),y — ^a,x holds for all a in 
But this may only be necessary for a restricted class of states a in S fi (^). An experiment 
often consists of two parts: a system srf to be examined in an unknown state p and a mea- 
surement apparatus 38 in a known default-state 8 r. sé contains the observable X that is 
to be measured. 33 contains some 'pointer-observable' Y , Automorphic time evolution on 
Q = sé ® 3è may now take place in such a way that Pa«(p®T),i®y = ^p<»t,X01 = Pp,x- 
Then Y is observed, directly or indirectly. The set of states a in y{3$) for which 
Pa»(o-),i®y = Pct,x®i must hold is in this case {a — p® t \ p £ SP{sé) }. 

We define M* : 5? {sí) -> ^{sd ® 38) by M*{p) = a*{p® t). Because it is affine, M* 
can be extended to a linear mapping £Í* — > {sé ® 3$)* on all continuous linear functionals 
on sé ' . It is therefore the dual of a linear map M : sé <£> 3$ — > sé. (Hence the notation 
M*.) Because M* respects normalization, M is unital: M(l) = I. And because M* maps 

8 This default-state r certainly need not be pure. 
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states to states, M is positive: B > => M(£>) > 0. M is even completely positive. (See 
chapter|3) 

In summary, an affine map M* : -> 5?{sé®8g) of the form M* (p) = a* (p <g> r) 

is by definition a perfect measurement iff Pm*( p ).y = ^p.x for all p € S"{s^). 

Example 

This example is basically due to Hepp (see jHep| ): let £^ = M2 (8 M2, M2 I is as an 
electron with spin X = a z <S> I. It is in an unknown state p £ 5?{M<}), I (8 M% is again 
a computer memory, capable of storing one bit of information. It is in the default-state 
r = ip+ . The actual memory c € is generated by Y — I <£> a z . 

In the notation of pageEl a is induced by the unitary operator P + ® I + P_ ® cr^ . It is 
the controlled not-gate. One easily checks that a(Y) equals a z ®a z and not X. However, 
for all p in S^[M-i) we have 

M*(p)(Y) =a*(p®r)(F) = p®ip + {a z ®a z ) = p®ip + {a z ®ï) = p{X). 

can now observe Y directly, finding the same probability distribution that direct ob- 
servation of X would have delivered. 

Unbiased Measurement 

We will also consider unbiased measurement: the average is transferred from X to Y, 
but not necessarily the entire probability distribution. Automorphic measurement is au- 
tomatically perfect. 

Now M*(p)(Y) = p(X) Vp E -YÇ-b/) M(Y) = X. This characterizes unbiased 
measurement. In contrast to automorphic measurement, this does not imply M(/(Y)J = 
f{X) for all continuous /. This means that in general Pm*(p),y 7^ F ft x, although the 
averages do coincide. For perfect measurement, equality holds: Pm*(p),y = ^p,x or 
equivalently M(/(Y)) = f(X) V/ G ^(Spec(X)). Yet even for perfect measurement, it 
may well be that a(Y) ^ X. In the example above, M* is an unbiased measurement of 
X: M(Y) = X. It is even perfect: M(/(Y)) = f(X) for functions on Spec(A). But it is 
not automorphic: a(l ® a z ) = a z ® a z , so a(Y) ^ X. 

In summary: all automorphic measurements are perfect. All perfect measurements 
are unbiased. And both statements cannot be reversed. 
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1.4 State Reduction 



Before plunging into the generalities of state reduction, let us look at an example. 
1.4.1 An Example of Reduction 

Let & = M 2 <g M 2 <g M 2 . Again, M 2 (g I (g I is an electron in unknown state p <E ^(M 2 ). 
It has spin X — a z <g I (g I to be measured. I (g M2 (g M2 is a computer memory in default 
state t = tp + (g ■ It is capable of storing two bits of information. ^ is the commutative 
àlgebra generated by Yi = I <g cr z (g I and Y 2 — 1 (g I <g cr z . 

First, in exactly the same way as above, X is measured and the information is stored 
on Y\\ at 1 is induced by the unitary operator P+ <g I <g I + P_ ® a x ®1. Then an- 
other, similar measurement of X is performed using Y 2 as pointer: a^-tx is induced 9 by 

P + <g I «) I + P_ (gltgC^. 

Finally, ^ directly observes Y\ and Y 2 in state M* (p) = a\ 2 (p®r). The results will 
be distributed according to the probability distribution Pm*(p),Yi,Y" 2 : 

M*0)(I<gP + (gP+) = p(P+) , = M*( ( o)(I(gP + <gP_) 

M*(p)(I(g P_ <g P+) = , p(P_) = M*(p)(I®P_®P_) 

In other words, c € observes: 

Yi = +1 and Y 2 = +1 with probability P p ,x({ + 1}) 
Fi = -1 and Y 2 = -1 with probability P PiX ({-1}) 
Yi = — 1 and Y 2 = +1 with probability 
Yi = +1 and Y2 = — 1 with probability 0. 

^ may intèrpret this correlation causally: the first measurement outcome influences 
the second. Correlation can also be seen with the help of the reduced state: 

{M*(p)) mF+& (A) = 

p (g r(a t2 ((I (g P+ (g I)A(I (g P+ (g I))) 

M*(p)(I(gP+(gI) 

p(gT(((P+(gP+ +P_ <gP_)(gl)a t2 (A)((P + (gP+ +P_ <gP_)(gl)) 

p (g r(P+ (g P+ (g I + P_ (g P_ (g I) 

p(gr(((P + (gl) ®l)a t2 (A)((P+ (gl)(gl)) 

= « t * 2 (pp+ ®r)(A) 
= M*( PP+ ) 

In the third step, we have made special use of t = ip + <g 1/'+ • According to the discussion 
following definitionQ] the above equation has the following significance: 

Observations made by c é ', conditioned on the first measurement outcome Y\ = +1, 
will be as if the electron had originally been in the reduced state pp + . 

9 In realistic models, at 1 and att 2 -t 1 belong to the same dynamical semi-group, so that at 1 °a t2 -t 1 = 
a t2 -t 1 ° oit 1 . This is satisfied in this example. 
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Since pp + = ip+ for all p in Sf{Mi), and because M*(í/>+) = ip+ ® this explains that 
if, according to c é' ) the first measurement yields +1, so does the second. 

Nota Bene 

One would be tempted to pose the following question: 

Suppose that observes Y\ = +1 at time t\. Does c € then necessarïly observe 
Yi = +1 at time t 2 ? 

This question is metaphysical in nature because it cannot be answered by experiment. At 
time Í2j how do you know what you observed at time íi? You must consult some memory. 

Any experiment one could possibly devise involves a memory (possibly external; a 
piece of paper or a hard-disk) storing information on Y\. Above, this memory is simply 
ii itself. And just as above, the result of observing this memory at time Í2 will never 
yield discrepàncies within the memory, independent of the observation made at time t\ . 

1.4.2 Reduction as a Consequence of Measurement 

Many examples of the kind above have been described ( |Hep| , |Bohl p. 292], |Bfc.TI p. 678]). 
But in fact, reduction is not just possible, as has been known for a long time (see |Neu| ). 
It is a necessary consequence of transferring information from X to some pointer Y. 

The Origin of Reduction and Collapse 

Let M : sí ® 28 — > sí be such that M*(p) = a{p ® r) for some automorphism a and 
t e S"(2§). Suppose M is a perfect measurement of X £ sí with 10 pointer Y e sí % 28. 
Suppose that sí and 28 are von Neumann algebras, as in the previous example. Then 
X and Y have projection valued measures V i— > P(V) and W i— > Q(M / ) respectively. If 
p is normal, then P M *( P ),y = P^x implies M*(p)(Q(V)) = p(P(V)) for all subsets V of 
Spec(A): the spectral projections of X are measured, using the corresponding ones of Y 
as pointers. In this situation we can apply: 

Proposition 3 (Reduction) Let sí and 2§ be C*-algebras, P € sí and Q € sé ® 28 

projections. For p € 5^ {sí'), let M*(p) = a*(p ® r) /or some automorphism u of sí ® 28 
and t £ ,Y{2§). Suppose M*{p){Q) = p(P) for all p € .9" {sí). Then for any p € ^(^); 

(M*(p)) Q = M*(p P ). 

Proof : 

By moving to the GNS-representation, we may assume p ® r to be a vector 
state |i/>). Now by assumption, 

a*(p P (8 r)(Q) = M*(p P )(Q) = p P (P) = 1. 
10 This includes automorphic measurement if ^ happens to be I and M automorphic. 
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Since (pp <S> t) — (p<g> r) P( g )I corresponds to the vector state hp^uvii i 
we have 

a {pp r)(Q) = — iipgiwp — = ( } 

a(Q) is a projection. Therefore equation lll.2|) implies 

|HQ)P®i|V->)ll 2 = \\p®i\í>)\\ 2 

which entails, again because a(Q) is a projection, that 

a(Q)P <g> 1#) = P® I|^>. (1.3) 
In a similar fashion, M*(/9i_ P )(Q) = /9i_p(P) = leads to 

a(Q)((I-P)®l)|V) =0. (1.4) 
Equations 11.31 and 11.41 imply 

a(Q)\xj;) =P®ÏÏ|V>. 

Thus for all D € ^ ® SS: 

(M(/?))qW - M*(p)(Q) 

(a(Q)yj\a(D)\a(Q)yj) 
P(P) 

((P(g)II)V>|a(£>)|(P(g)II)V>) 
p <g> r(P (g) I) 
= (p8T) P8 i(a(fl))=a*(/)p«T)(D) 
= M*(p P )(£>). 

g.e.d. 

State Reduction 

This has two major consequences. The first is subjective: 

Suppose that a perfect measurement of X £ & is performed with pointer 
Y G fé 3 C SI. If *2> was in state p £ S^(s^) before measurement, then all 
observations made by fé 7 after measurement, conditioned on the observation 
that the measurement outcome Y is in the set V , will be as if the system had 
originally been in the reduced state pp(y) instead of p. 

We now have an interpretation of the reduced state ppnr) outside Y' . Perhaps the fol- 
lowing commutative diagram says more than a thousand words: 
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p 



M*(p) 



conditioning 



reduction 



PP(V) 



(M*(pJ) Q(v) 
M*(p P(V) ) 



P M .0» ) ,y(·|[ymV]) 



P; 



The map p — > /Op(y) is called state reduction. The fact that we observe state reduc- 
tion (left hand side of the diagram) results from harmless conditioning on the physically 
relevant probability distributions (right hand side). 

State Collapse 

The second consequence is objective. It is summarized in the diagram below: 



Corollary 3.1 (Collapse) Let srf and SS be von Neumann algebras, X 6 .sé , Y 6 
sé ® 3$ Hermitean. Let { Vi | i G /} be any countable decomposition of Spec(X). Let 
M* : ,5^ (sé) — > SP(irf ® 28) be defined by M.*(p) = a*(p <g> r) for some automorphism a 
of sé ' ® SS and normal state r G 5^(33). Suppose M* is a perfect measurement of X with 
pointer Y , i.e. Pm*(p),y = ^p,x for all p G S^(sé). Then for all normal p G y(sé), and 
for all De Y': 



P 



M*(p) 



collapse 



on Y' 



C*(P) 



M* o C*(p) 




Proof : 



Let V i— ► P(V) and V i— > Q(V) be the spectral measures of X and Y respec- 
tively. Since (jj V = Spec(X), we have J2i Q(Vi) = 1 Suppose [D, Y] = 0. 
Then also [D, Q(V t )] = for all i G I. Therefore 



i 



= M*( P )(Y,Q(V l )DQ(V)) 



i 



= ^PM. W ,r(K)·(M*(p))q(v,)(í)) 



i 
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i 

= 53p(P(^)M(U)P(K)) 

7 

= (M*oC')(p)(fl), 

ç.e.d. 

Normally, a system ^ will be examined by an observer ^ C 2$ outside sé ' , This means 
that Y is of the form I ® Y. Then all A G ^ (8) ^ of the form À I will commute with 
the pointer. So regarded as a state on the examined system sé ® I, we have 

M*(p) = M* o C*(p). 

In other words: 

Suppose that a perfect measurement of X G sé is performed by an observer 
c £ C SS outside sé , using a pointer Y G I <£> SS. Then all measurements of any 
A G sé made by any second observer will be as if the system had originally 
been in the collapsed state C*(p) instead of p. 

For example, suppose that sé — M 2 is in vector state a\tjj+} + (l\ip-), and o z is 
measured perfectly by an outside observer. Then all subsequent measurement of sé will 
be as if sé had originally been in the mixed state |a| 2 • \ip+)(ip± \ + \(3\ 2 ■ \ip^)(ip-\- 

Summary 

Reduction is subjective. It involves only one observer. Reduction occurs after both direct 
and indirect observation. 

Collapse is objective. It involves at least two observers. Note the crucial role of 
[A, Y] = 0: if the first observer had been inside sé instead of outside, none of the above 
would hold. Collapse only occurs with indirect observation. Both are not just possible, 
but necessary consequences of measurement. 

1.4.3 Imperfect Reduction after Imperfect Measurement 

Suppose an unbiased measurement is not perfect, but still rather good. Then one does not 
expect a perfect reduction, but still a rather good one. Proposition Q allows such a gen- 
eralized version. In contrast to generalized collapse (which comes along quite naturally), 
generalized reduction is rather thorny and uncomfortable. But in the end, if we work hard 
enough, we do obtain a hard estimate of the reduction, even for biased measurement: 

Proposition 4 (Generalized Reduction) Let sé and Sè be C*-algebras, P G sé and 
Q € sé (gi SS projections. For p G jT'(sé), let M*(p) = f a*{p (g) r) for some automorphism 
a of sé ' ®SS andr G ^{S). Suppose there is a A > such that |M*(p)(Q) - p(P)| < A 
for all p G y(sé). Then for any p G ,5f(sé): 

||(M*(p)) Q - M>p)|| < M ^ (Q) (l + 2a/A + V /i + (1 + 2VA)^ 
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Proof : 



For notational convenience, define Po = P, Pi = I — P. Define ef = 
|p Pi (P)-M*(p Pi )(Q)| = |«y i)0 -M*(ppJ(Q)|- Hereef is the probability that 
a measurement of P in state pp i yields the wrong outcome. The depend 
on p, but are always less than y/A. By constructing the GNS representation 
of p ® t, we may assume p ® r to be a vector state l^)- Then the ej have 
geometrical significance: they regulate the length of the vector 

|Xi) = a(Q)Pi®I|V)-PPi®lW 

= (o(Q) - di.ojPj ® (1.5) 

by 

|| Xl || 2 d = f ||a(Q)P < ®I^)-í<,oP<®I|V)H 2 

= (P, <8 (o(Q) - 5 i>0 ) t (a(Q) - í i)0 )|Pi 

= (Pi ® I^|(l - 2í il0 )a(Q) + <5 il0 |Pi <8> IV») 

= (1 - 2£ i>0 )p $ t((P< ® I)o(Q)(P< ® I)) + íi,oP ® t(P» 1) 

= p(Pi)((l - 25 i)0 )a*(p<g> r Pi0 i)(Q) + St t0 ) 

= p(Fi)(\a*(p®T Pm )(Q,)-5 ifi \) 

= p(P i )(|M*(^p 4 )(Q)-í t ·,o|) 

= e?p(P<). (1.6) 
With this we will estimate 

M*(p)(QXQ) - M*( PP )(X) ■ M*(p)(Q) = (1.7) 
= a*(p® t)(QXQ) - a*(p P <g> t){X) ■ a*(p <g) r)(Q) 
= a*(p®T)(Q(X-a*(p P <g>T)(X))Q) 
= / 9®r(a(Q)a(X-a*(p P ®r)(X))a(Q)) 

= 51 ^ r ( Pfe ® MQMX - a* (p P <g> r)(JC))a(Q)Pj ® I) 

fe, i 

= ^(a(Q)P fe <g> IV»|a(JT - a*(p ® r P0 i)(X))|a(Q)P, <g> 1^) 
fe, i 

= ^(4,oPfe <8> + Xk\a(X ~a*{p® rp i)(X))|^, o P; <g Jh/> + Xl ). 
fe, i 

The last step goes by definition of \k- from equation jl.fip. we see that 
a(Q)Pfc (g) Ií/í = ófc,oPfc <E> 1^ + Xfe- We will examine the smallness of each 
term separately. 

= p ® r(P <g) Ia(X)P (g) 1) - p ® r(P (g) I)a* (p <g) rp®i)(X) 

= 0. (1.8) 
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That's one down. We will estimate the cross-terms with the Cauchy-Schwarz 
inequality. For that, we need the length of both ||xfe|| an d the vector 

\\a(X -a*(p®T Pm )(X))P®m\ 2 = 

a(X -a*{p® 7Pgi)pr))|P ® 1%I>) 
= {V ®l^\a{X^a{X) -2ïï(a(X)ïa*(p®T Pm )(X)) + 
+ \a*(p®T P<g , I )(X)\ 2 \P®IïP} 



= {P ®ma(Xya(X)\P - 

< ||P®I^|| 2 ||X|| 2 . (1.9) 
Now that we have the length of both vectors, we see by Cauchy-Schwarz: 

\\{Xk\a(X-a*(p®Tp & )(X))P<g>Ii>)\\ < ||X||·||P®I^||e fc ||P fc ®]ty>||. (1.10) 
And similarly 

\\(P®hl>\a{X-a*(j>®Tp & )(X))\xi)\\ < ||X||·||P®IV'||ei||P I ®I^||. (1.11) 
Finally, from \\a(X) - a*(p ® t p ^j(X)\\ < 2||X|| we see that 

\\{Xk\a(X -a*(p®Tp & )(X))\ X i)\\ < 2||X||e fc ||P fe ® IV>N|P/ ® h//\\. (1.12) 
Putting inequalities (jl.8l) , (II . 10f> , (|1,11|) and (11.1 2f) into ljl.7|) , we finally obtain 

||M*(p)(QXQ) - M*( PP )(X) • M*(p)(Q)|| < 

< 2||X||||Po®IV||(eo||Po®IVll+ei||Pi®^ll) + 

2||X||(e ||Po ®I^|| +ei||Pi®I^||) 2 
= 2||A:||(eo||Po®I^||+ei||Pi®I^||) x 

(||P ® ]ty>|| + e ||Po <8 I^|| + ei||Pi ® I^||). 

To estimate this last expression, note that Po and Pi are complementary 
projections, and is of norm one. Therefore, there exists an angle 9 such 
that cosí? = || I*o ® 1*011 an d sin# = ||Pi ®I^>||. Since both eo,ei < \/A, we 
have 

M*(p)(Q) - M (^)W M*(p)(Q) ^ (L13) 

with /(0) = 2 (cos 6» + sin6>)(cosé> + %/A(cosé» + sin0)). With Standard ana- 
lysis and goniometry, one can verify that / takes maximal value 1 + 2yA + 

1 + (1 + 2\/A) 2 , proving the proposition. 

q.e.d. 
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Proposition Q is contained in the above as the special case A = 0. Note that the 
bound disappears if the probability of observing measurement outcome +1 becomes less 
than l/2vA, This means that excel·lent measurement (A ~ 0) without reduction (upon 
finding +1) is not excluded, provided that the probability of outcome +1 remains small. 
Of course there is conservation of misery: the probability of outcome is large, and upon 
finding there is very good reduction. 

Nonetheless, this principle can be used nicely in so-called 'knowingly reversible mea- 
surement' (see jDArj l. This is a non-perfect measurement, leaving the state fixed with a 
certain probability. The observer obtains not only a measurement outcome, but also the 
information whether or not the state is conserved successfully. 
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1.5 State Collapse 



On nage ll51 we have obtained collapse from reduction in order to show the link between 
the two. But there is an easier way of proving the necessity of collapse, more suitable for 
generalization. 

1.5.1 Perfect Collapse after Perfect Measurement 

The setting is slightly different: an observer ^ C £3 attempts to decide whether a system 
sí is in state -01 or in In order to do that, a measurement M* : y(sí) — > y{sí <g> 3$) 
is performed (of the form M* (p) = a*{p®r)) in such a way that observation of a certain 
pointer-observable Y € yields with certainty y\ in state M* (ipi ) and y<i in state M* (fa). 

Lemma 5 Let \<pi) (i — 1, 2) be vector states on some àlgebra 3 ', Let Y S *2) be Hermitean 
such that 

Var^y) = and (&|y|&> = (i = 1,2) 
wïí/ï yi ^ y 2 - Tften /or all i £ f smc/í í/íízí [j4, Y] = 0; 

(<l· 1 \A\<h) = 0. 

Proof : 

|</>i) and | çí>2 ) must be eigenvectors of Y with eigenvalues y\ and 2/2 • Since 
[A, Y] = 0, A respeets the eigenspaces of Y. We therefore have \<fi\) _L \A(f>2): 

{vi-V2)(<h\A\<h) = (tfi0iH^a)-<0iHw0a) = A]|0 2 ) = 0. 

g.e.d. 

This Standard result can be used in the following way: 

Proposition 6 (Collapse) Let M* : &{sf) -> 5?(sí ®3$) be of the form M*(p) = 
a*(p®r) for some automorphism a of sí ® £$ and r <G 5? (88). Let tpi, (i = 1,2) be 
vector states on sí , let \a\ 2 + \f3\ 2 = 1 and let Y G sí <S> 8$ be Hermitean such that 

Var M , (V)i) (Y)=0 and M*(^)(Y) = w (i = 1,2) 

wií/i y í ^y 2 . Then for all A £ sí ® 8§ such that [A, Y] = 0: 

M*(|a^i + ^2>W> 1 +frh\)(A) = M*(|a| 2 |^)(^i| + |/3| 2 |^ 2 ) (M)(A). 

Proof: 

By the GNS-representation, we assume r to be a vector state |r). Thus 

IM^IaV'i+W^i+^D^-M^laHV'^^il + l/íl 2 !^)^!)^)! 
= |((cm/>i + fifo) ® T\a(A)\(aipi + /3ip 2 )®T) - 

(|a| 2 (V>i ® r\a(A)\fa ® r) + |/?| 2 (V>2 ® t|o!(A)|V>2 ® r)) | 

< 2\a\\(3\\{ipi ® T\a{A)\ip 2 ® t)\ 

< \{ip! ® T\a(A)\ip 2 ® t)\. (1.14) 
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The last step uses that 2\a\\(3\ < 1 since \a\ 2 + |/?| 2 = 1. 

Finally, we come to lemma J&J, here with the vectors \ipi ® t), (i — 1,2) and 
with the observable a(Y): 

iP i ®T(a(Y))=M*(iP i )(Y)=y i (1.15) 

and 

Vax^ r (a(Y)) = ® r(a(y) 2 ) - ^ <g> r(a(r)) 2 

= V^®r(a(y 2 ))-^®r(a(y)) 2 

= Var M , (V ,.)(F) 

= 0. (1.16) 
Thus (-01 ® r|a:(A)|<02 O r) = 0. 



q.e.d. 



If y is of the form I ® Y, then [A ® I, I <g> Y] =0 for any Aerf, This means that after 
the measurement is performed on srf in vector state \atpi + /3ip2), further measurement of 
any A S by other observers will be as if the state had originally been |a| 2 |-í/'i)('0i| + 

Of course the same holds for other observables commuting with Y, such as observables 
in ^ for example, or 11 in íé ® e é'. All of this has an immediate generalization for the case 
of a less perfect measurement, and for observables not quite commuting with Y . 



1.5.2 Imperfect Collapse after Imperfect Measurement 

Lemma 7 Let 4>i, (i = 1,2) be vector states on some àlgebra 3$. Let Y be a Hermitean 
element of srf such that <j>i(Y) ^ <f>2ÍY). Let 

ct>i(Y) = Vl and Var 0i (Y) = af {i = l, 2) 

be the expectation and variance of Y in the state 4n, (i = 1,2). If A is a Hermitean 
observable such that \\[A,Y]\\ < S\\A\\, then 

|(frl4fr)|< J + ^ + °V ll· 
m - 2/2 1 



n If the measuring device happens to be classical {3§ is commutative), then a complete and rigorous 
collapse has been achieve d. Thi s was proposed by Jauch (see |.Ta,nl p. 174]). Although this is an extremely 
useful remark (see e.g. [Hep ), I do not hold this to be a fundamental solution to the problem of 
measurement for the following reasons: 

- Measurement apparatuses consist of partides. Partides have momentum and position. These do 
not commute, so Abelian SS can only be an idealization. 

- Automorphic time evolution always conserves purity of states (see |Hep| lemma 2]). Even on 
Abelian algebras. 

In chapter|21 we will examine this idealization more closely. 
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Proof : 

For i = 1,2, 

( r t 2 = (^|Fty|^)_(^|yt| . )( 0.| F |^ ) . 

So 

<7? = <y&|i-p // |Y&) 

where P// denotes the projection onto the one-dimensional vector-space spanned 
by \4>). So I — V 1 1 is the projection orthogonal to \<j)i). We denote it by P_l. 
From P^_ = Pj_ = P_l we see that 

uí = llPx^ll. 

Decomposing Y\<j>i) into components parallel and perpendicular to \<pi) we find 

Y\<fn) = -P // Y\4 i )+P ± Y\<IH). 

Denoting P±Y\(/) i ) by \xí), bearing in mind ||xí|| = oc 

Y\<f>i) = (&|Y+|&>|&) + = Vi\<t>i) + |Xi). 

We use this in the following: 

(V2 - Vi){cl>i\A\(f> 2 ) = - (t/i0i|A|0 2 ) 

= (0i|Ar|^a) - (0i|A|x2) ~ (^i|YA|0 2 ) + (xi|A|^) 
= (^|[^F]|0 2 ) + (xi|A|0 2 > - (0i|A|x2>- 

Estimating with the Cauchy-Schwarz-inequality and the operator norm in each 
term we obtain 

\(y 2 -yi)\- 1(^1^)1 <(S + a 1+ a 2 )\\A\\. 

q.e.d. 

We use lemma Q in the same way as lemma {2J : 

Proposition 8 (Generalized Collapse) LetM* : S?{sf) -> ® ^) 6e oftheform 

M* (p) = a* (/9®r) /or some automorphism a of ® S§ and r £ ^{0). Let ipi, (i = 1,2) 
6e vector states on s>í ', let \a\ 2 + \/3\ 2 = 1 and letY £ ® 33 be Hermitean such that 

Var M . w (y)=(7 < and M*(^)(Y)=w (i = 1,2) 

imí/ï y í ^y 2 . Then for all A £ stf ® 2% such that [A, Y] < S\\A\\ : 

|M*(|aVi +Pih)(<x>k +PM){A) - M*(|a| 2 |^i)(^i| + |/3|> 2 )<V> 2 |)(A)| 

< ^r^ui 

\Vi ~ 2/2 1 
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Proof: 



As the proof of proposition ©, but we now estimate equation 11.141 with 
lemma Q instead of lemma © . 

q.e.d. 

The ratio í^~j is an indicator of the quality of measurement: suppose you know that, 
prior to measurement, srf is either in state ipi or ip2- To find out which, you perform 
measurement. Suppose y\ < 2/2, then you conclude that the state was -02 if the pointer 
Y takes value > Vl + y2 , The probability of deciding ip% while the state was really ipi is 

less than j^z^i by Chebyshev's inequality: Pm»(^ 1 ),k(|^ - Vi\ > lyi ~ y2{ ) < 

Of course the same goes for 1 «-> 2, so that , — — — r + t— — — , indicates the quality of 

b ' \yi-V2\ \y\-yz\ M J 

measurement indeed. 
Conclusions 

For the case of a pointer outside sé y I will summarize some consequences of measurement 
which will come in handy in the next chapter: 

- Collapse takes place on the commutant Y' of the pointer. This includes the original 
àlgebra ® I. 

- An imperfect collapse will also occur on observables A commuting well with the 
pointer Y in the sense that ||[j4, Y]|| < S\\A\\ for some small S. 

- If the measurement is imperfect ( a |i/'i)-measurement yields outcome ai only with 
high probability ), then also an imperfect collapse will occur on the commutant of 
the pointer. 
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Chapter 2 

Macroscopic Observables 



Suppose that an outside observer c € C 83 performs measurement on a system si. Then 
we have seen that a collapse must always take place on the original system si ® I. This 
simple and rigorous law of nature is, I believe, the collapse of the wave function usually 
alluded to in elementary textbooks on quantum mechanics (e.g. |B&.T| . |Dirj . |Boh| . |Nenj 
and even j.Taul p. 184]). 

But on the combined system si ® 83 there always remain observables with respect to 
which no collapse occurs. Indeed, Hepp and later Bell (see |Hep| and [Bel ) have pointed 
out that all one has to do to track these down is to run time evolution backwards. 

On M 2 , the observable a x may serve to distinguish mixtures from super- 

positions (p. EJ) • But if an outside observer c € C 83 performs a ip+/ip- measurement 
M* : S fi (M 2 ) -> -V(M 2 ® B) of the form M*{p) = a*{p®r), then 

M'j^a" 1 ^»!)) =p<g> T ((r x <g>I) = p(a x ) 

so by performing an a~ l {a x ® I)-measurement on M 2 ® 88, a second observer "té, outside 
Mi and 83, can indeed ascertain that a full collapse has not taken place 1 . In practice 
however, collapse is observed after measurement, even by the second observer. 

In my view (interpretationEl, collapse on closed systems simply does not occur. Ever. 
Which leaves me to answer: 

Question 1 Why are the remaining coherences so hard to observe in practice? 

Suppose one were to take the point of view that a rigorous collapse on closed systems 
does occur after measurement. (Interpretation[21) Then one needs to answer the following 
question: 

Question 2 Exactly when does collapse replace unitary time evolution on closed systems, 
and why is it so hard, in practice, to see the difference between collapse at one time rather 
than another? 

Which point of view to take is merely a matter of taste, not of importance. An answer 
to question entails an answer to the last part of question (2J : If it is hard to see the 

1 This in contrast to state reduction, which involves only one observer. See also n. lfíOI 
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difference between unitary time evolution and collapse, it is certainly hard to see when 
the former goes into the latter. 

To question I see three answers. Two of them testify to the weirdness of the 
observables on which no collapse occurs. Or rather, to the occurrence of collapse on 
classes of ordinary observables: 

2.1 Collapse on the Measurement Apparatus 

First of all, we have seen how collapse comes about on sf®l: one simply applies lemma 
to 0i = M*Oi) and 4> 2 = M*(^ 2 )- 

Collapse on the Original System 

The measurement is perfect if, starting with with ïpi, (í = 1,2), the pointer position 
after measurement is always y% = M* (ipi)(Y): then the corresponding variances of of the 
pointer observable Y — 1<S>Y equal 0. This results in a perfect collapse on s/ ®\ C (Ï<S>Y)': 
proposition 

Suppose the measurement is flawed, i.e. the input ipi, (i — 1,2) does not absolutely 
guarantee the pointer output j/j . Then it may still be possible to draw reliable conclusions 
from the pointer about the examined system, provided that 0$ -C \yi — y 2 \ íot i — 1, 2, or 
briefly , ai+<T2 , < 1. 

We no longer have any reason to expect the 'clean' collapse discussed above, but still 
an imperfect measurement must surely induce some imperfect collapse on sé ® I C Y' . 
This is proposition ©. 

Collapse on the Measurement Apparatus 

But a wider range of collapse can be obtained with the same ease: assume for example 
that a measurement M* distinguishes two eigenstates ip xi and ip X3 of some X G sí in a 
repeatable fashion. This means that another measurement of X ® I (perhaps by another 
observer) in state M.*(tp Xi ), (i — 1,2) will once again yield Xi with certainty: 

M*(ip Xi )(X®ï) =Xi and Var M , (jKi) {X ® l) = (i = 1,2). 

Then we can apply proposition © to the measured observable X®1 instead of the pointer 
1®Y , Collapse then occurs not only on (I(8>F)', but also on (X , which includes the 
àlgebra of the measurement apparatus, 

In exactly the same manner as above, an approximate state collapse on {X®T)' D \®S$ 
follows from proposition JHJ, provided that 

Var M . (v , xi) (X(8)I) + Var M *(^ 2 )(^ «) I) < \x x ~x 2 \ 2 . 

So the thoroughness of collapse on (X<g>ï)' is not regulated by the quality of measurement, 
but by how well M*(^i), (i = 1,2) remain eigenstates of X. Already, we have a first 
answer to question 

Answer 1 In a repeatable measurement, the remaining coherence can neither 
be detected on the original system, nor on the measurement apparatus alone. 
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2.2 Collapse on Local and Global Observables 



We have seen that there are observables on which no collapse occurs, but if the measure- 
ment is repeatable they lie neither in the original system íé ® I nor in the measurement 
apparatus I (8 8$. Moreover, the vector space (Not the àlgebra!) spanned by the commu- 
tant of the pointer Y and that of the measured observable X, denoted X' + Y' , allows no 
coherences to be detected. This already testifies to the weirdness of the observables we 
are looking for: I for one would be very interested to learn about actual measurements 
performed (with the help of a second measurement apparatus) on observables in ® S§, 
but outside X' + Y' . If they do exist, they are certainly quite exòtic. 

But there are two more classes of ordinary observables on which collapse takes place: 
the local ones and the macroscopic ones. This, I believe to be the main point of Klaus 
Hepp's 1972 article 2 'Quantum Theory of Measurement and Macroscopic Observables' 

TO - 

2.2.1 K. Hepp: Quasilocal Algebras 

Hepp investigates the possibility of modelling time evolution by a weak limit of auto- 
morphisms and, as the title suggests, pointers by so called 'macroscopic observables' in a 
quasilocal àlgebra. 

Let me try to suppress my sense of guilt about not explaining these notions properly 
by giving an example: imagine a countably infinite chain of quantum spins Mi indexed 
by n € N, their position on the real line. Local observables are supposed to affect only 
a finite amount of spins. For example, the spin in the z-direction of atom number i, 
a\ = I <8> . . . <g> I <g> a z <g> I <g> I ® . . . , is a local observable. So is St=i "'zi the average 
z-spin over the first N àtoms. Now a quasilocal observable is almost local in the sense 
that, outside a finite amount of sites, it is arbitrarily close to I in norm. 

Macroscopic observables however are not supposed to lie in the quasilocal àlgebra. We 
would like them to be something like 'averages': let Y n be a uniformly bounded sequence 
of local observables 'converging to infinity' in the sense that Y n has to do with spins 
arbitrarily far away from the origin for n sufnciently large. Then it would be pleasant 
to call limjv^oo jf Yli=o a macroscopic observable. For example, take Y n = er™. Then 
S z = limjv->oo jf J2iLo a l 1S the average spin in the z-direction. 

Unfortunately, this limit does not exist. At least not in norm. But if we choose one 
particular state on the àlgebra, we may form its GNS-representation. (See jKfcRl p. 278].) 
Then we have at our disposal a weak topology, coarser than the norm topology, in which 
the limit may well exist. 

In short, macroscopic observables lie in the weak closure of some represented quasilocal 
àlgebra, but not in the àlgebra itself. The crux of Hepp's article is macroscopic difference: 



2 The author himself did not, for as far as I can teli, seek to make this particular point. I take the liberty 
of interpreting his results in a different fashion, utilizing Hepp's considerable mathematical achievements 
in a context slightly different from the one originally intended. The following digression should not be 
seen as a summary of Hep , but as a highly personal interpretation. 
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Definition 3 (Macroscopic Difference) Let u>i and u>2 be states on a quasüocal àlge- 
bra @ ', Then u>i and u>2 are called macroscopically different if there exists a uniformly 
bounded sequence of observables Y n converging to infinity, and real numbers y\ ^ y 2 such 
that 

n=N 

J lm K Y,^ Y n)=Vi (* =1,2). 
N—>oo iV * — ' 
n=l 

and the upshot is formed by the following two lemmas: 

Lemma 9 (Lemma 6 of |Hep|) Letoúi anduj-i be macroscopically different states on a 
quasüocal àlgebra & having short range correlations. Then u>i and cj 2 are disjoint. 

Lemma 10 (Lemma 3 of |Hep|) Consider two disjoint states u>i anduj-i on a quasüo- 
cal àlgebra S>, and two (not necessarïly disjoint) sequences uii t t and u>2,t such that 
limt_>oo <^i,t = Ui (i — 1,2). Let (7r t ,J^ t ) be representations of S> and ipi,t,ip2,t G J%r t 
such that Lüi^t(A) = {i/ii,t \'Kt{A)\ipi t t) (i = 1,2) for all A S @. Then for all quasüocal 
D 

lim (^i lt |7r t (D)|^,t) = 0. 

These lemmas may be used as follows: we are attempting to measure, say, the observable 
a z in the àlgebra M 2 . We do this by coupling M 2 to a quasüocal àlgebra 88 in a state 
t. We now seek automorphisms a t of & — M 2 ® 88 such that lim^oo al (ipi r) = lví 
(i = 1,2), where w± and w 2 are short-range correlated, macroscopically different states on 
the quasüocal àlgebra 3. (Hepp gives several explicit examples of such constructions.) 

We may now use lemmas © and itTTUl consecutively to see that for each fixed quasüocal 
A G M 2 ® 88, all the 'cross-terms' go to zero: 

lim (V>i,íkí(-D)lV>2,t> = 

t — >oo 

in the sense of lemma 1KH . with u^t = a\ (ipi (g> r) for i = 1, 2. 
The relevance of this all to question Q is clear: 

Pointers used in real life are often macroscopic. 

Furthermore, macroscopic information is more easily detected than microscopic informa- 
tion. This is the very reason for using macroscopic pointers. A ray of light shining onto 
a measurement apparatus is almost certain to record the (macroscopic) position of the 
pointer, i.e. the average position of some 10 23 àtoms. It may even accidentally record the 
position of one single atom. But it is very unlikely to record the detailed excitation of 
each of these àtoms from their respective equilibrium positions. 

In other words: Hepp points out the classes of quasüocal and macroscopic observables 
as ordinary ones. In his spirit, an answer to question could be: 

'In the course of measurement with a macroscopic pointer, reduction occurs 
increasingly well on all quasüocal observables' 

However, it is not easy to pinpoint the exact physical relevance of Hepp's weak-operator 
limit procedure: 
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- First of all, the notion of a 'Macroscopic' observable is only mathematically defined 
on a quasilocal àlgebra. In reality, the àlgebra describing an actual measurement 
apparatus is usually extremely large, but not quasilocal. In what way, if at all, are 
Hepp's result approximately vàlid? 

- Secondly, we only come 'close' to macroscopically disjoint states in the weak topol- 
ogy. At each fixed time í, at is still automorphic. So there remain quasilocal 
observables A for which the cross-terms are large. On the other hand, for each fixed 
quasilocal observable A, the cross-terms do become small in the course of time. 
Putting it more precisely and less clearly: 

Ve>0 MAestf®,^ 3teR: t' > t => \(ip +ít \n{A)\t/j-, t )\ < e , 

yet 

3e>0 VíeR 3Ae£f®3§ 3t'>t: \(ip+, t \M A M-,t)\ > e ■ 
2.2.2 Local Algebras 

We'll give up quasilocal algebras all together, and with it the sharp distinction between 
local and macroscopic observables. Then we will give estimates on the amount of state 
collapse on 'local' observables, yet to be defined, based on: 

- Exactly how local the observable is. 

- Exactly how macroscopic the pointer is. 

- Exactly how much macroscopic difference there is. 

Instead of utilizing Hepp's machinery, we shall resort to lemma (0- Let us describe our 
combined system @ by a large but finite number N of possibly different àtoms, each 
described by an àlgebra @ç. Çiï — <2> i=1 Such an àlgebra, plus its (non-unique) 
subdivision into àtoms 3 may be called a local àlgebra. 

Let X i £ %. We will denote by the observable li <g> ... <g>A l ® ... ®l N 

in @. Although no sharp distinction can be made between macroscopic and microscopic 
observables, it is intuitively clear that for each set { X 1 \ i £ {1, . . . , N}, X z S S>i }: 

- jj YliLi is an average, or very global observable. It might well be observed by 
accident. 

- A37 is a very local observable, representing detailed information about one particular 
atom. (Number 37.) A measurement of A37 would probably cost a lot of effort, and 
is unlikely to be performed by accident. 

- X 1 (g) X 2 ® . . . (g) i"" 1 ® X N represents an observable giving detailed information 
about all àtoms in the measurement apparatus. It is unlikely that such a measure- 
ment can ever be performed at all, let alone accidentally. 

3 By 'atom', I just mean some small part of the àlgebra. It may represent an electron, atom or molecule, 

or any other structure small compared to @. The same àlgebra @ = (^f^j 10 (@r®@o®&ïi), describing a 
mole of wàter, must be considered a different local àlgebra according to whether one chooses the hydrogen 
and oxygen àtoms 3>yi and 3iç> as local àtoms, or the wàter molècules 3>n ® S#ç> tS 1 Shi- 
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So we would like to show that, if the pointer is Y = j? X)i=i some ver Y global 
observable, then observables with respect to which no collapse takes place are certainly 
not very global, nor very local, and typically of the third variety mentioned above. We 
will start by quantifying these rather vague notions: 

- Definition 4 (n-local) A Hermitean X is called n-local iff there exist integers 
1 < ix < . . . i n < N such that X G % x ® . ■ ■ ® % n C 9. 

An observable X G *2> is 'n-local' if it only affects n àtoms. A37 is 1-local. Of course, 
each 71- local observable is also n-local if n > n. 

- Definition 5 (/t-global) A Hermitean Y G is called n-global iff there exist 
M G N, 1 < i\ < . . . < %m < N and Hermitean Y ik € % k such that: 

• k > jj[ for all k. 

For example, J2i a l as on page[23is 7V _1 -global. X 37 is || A 3 7|j-grobal, for exam- 
ple because it equals A37 = j(X^i), or, if you happen to be in a troublesome mood, 
because A37 = -^ (0 + . . . + + NX37 + . . . + 0) . An observable will be called global 
if it is K-global for some k G R. Not all observables are global. 



The amount of difference between two states 4>i and 4>2 on some global Hermitean 

| J/1 -1/2 | ' 



Y can easily be quantified by the ratio i* 71 ^* 72 , , where yt — <j>i(Y) and of 



MY 2 )~MY) 2 , (i = 1,2). 

These definitions allow us to apply lemma (0 in the situation of a measurement using a 
global pointer to distinguish ipi from ipj: ~M.*(ipi) and M*(-0j) are globally different. 

Corollary 7.1 4 Let 4>i,4>2 G y{2>) be K-globally different vector states, i.e. there is a n- 
global Hermitean Y G S> such that 4>\{Y) 7^ §2 (Y). Let yi.2 — <fii,2(Y) be the expectation 
ofY in 4>i t 2, and o\ 2 = Var^ 2 (Y) the variance. Let a, (3 £ C be such that \a\ 2 + \f3\ 2 = 1. 
Then, for every n-local A: 

| W>r + ph\A\a^ + Pfa) - (M 2 (^I<M + \(3\ 2 (HA\h))\ < 2 ™ + <Tl+ . a * \\A\\. 

m - 2/2 1 

And for every global observable Y' = -A? J2?=i Yj, with < y' '■ 

\{a<h + /W|a& + Ph) - (|a|Vil^l^i) + l/3| 2 (02|l"|^2))| < 2K , + ai+ ° 2 y'. 

m - 2/2 1 



4 Consider the example of a cloud of N partides, with positions x l and momentap 1 . These observables 
are not bounded, but since [a; l ,í>'] is, this is merely a technical problem which may be averted by, 
for example, a cut-off. The position of the cloud, X = jfYsi=i x ii ' s increasingly macroscopic for 
increasing N. However, the total momentum of the cloud, P = J2i=iPi, ls I li^^ 1 1 -global, irrespective of N. 
So since [X, P] = ih, it would seem that reduction on X, using P as pointer, is good because h is small, 
and not because N is large. This is not the case however: of importance is the ratio h/\pi — p%\, and 
typical momentum differences do grow as N increases. 
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Proof : 



Since 2|a||j9| < 1, all we have to do is apply lemma JJJ to (<fii \A\<f>2). For the 
first inequality, we write Y = jj Ylk=i ^t** Since at most n of the Y ik do not 
commute with A, we have 1 1 [Y, A}\\ = W^^L^, A}\\ < 2n\\A\ \ ^là^Chl < 
2n«;||A||. For the second inequality, we use that || , i^JII < 2nMy', and that 
it equals zero if i k ^ j t . We obtain ||[F,y]|| = || jpjj Efeïíf/i^ K^^iJII < 
TFm Et=ífíÍT M ' 5(i k ,ji)2KMy> < 2k V '. Of course ||Y'|| < y'. 

q.e.d. 

Bear in mind that k, for a typical pointer, will have vàlues in the order of 10~ 23 . For a per- 
fect measurement (<Ji,2 = 0) this means one may probe detailed information about billions 
and billions of àtoms simultaneously without running the slightest risk of encountering 
any lack of collapse. 

Furthermore, in the case of non-perfect measurement using a K-global pointer, as long 
as nu <C dl + (72, all experimentally observed coherence may be attributed to the poor 
quality of measurement. All of this constitutes a second answer to question QJ: 

Answer 2 When using a very global pointer, the coherence remaining after 
measurement can neither be detected on very global observables, nor on very 
local ones. 
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2.3 Global Information Leakage 



So suppose we are in the circumstance of a repeatable measurement with the help of 
a very global 'pointer', say an actual pointer. We have seen that observables on which 
no collapse occurs are present on the combined system, but they can neither be entirely 
inside the measurement-apparatus, nor entirely in the atom. They can also not be very 
global, nor very local. All in all, they are pretty weird indeed. Yet in principle, they do 
exist and they can be measured by a second observer. 

2.3.1 Information Leakage After Measurement 

One should be extremely careful with this kind of reasoning, however. Imagine, for 
example, that M% represents a two-level atom, and describes some large measuring 
apparatus, measuring eigenstates of a z ® I with pointer I®Y which represents, literally, 
the position of a pointer. (The average position of all àtoms in the pointer is of course 
rather global.) Then as soon as 1® Y is measured (with the help of an ancillary system, 
e.g. light reflecting on the pointer and reaching our eyes), collapse on the combined 
atom-apparatus system takes place. It is of course immaterial whether or not someone 
is actually looking at the photons. If even the smallest speck of light were to fall on the 
pointer, the information about the pointer position would already be encoded in the light, 
causing full collapse on the atom-apparatus system. 

So as soon as the information about the pointer position has reached our eyes, we can 
be mathematically certain that collapse on the atom-apparatus-system has taken place, 
even on those extraordinary observables that commute poorly with the pointer of the 
measurement apparatus. However, if no such information has reached our eyes, we may 
still be practically sure that all measurement on the combined atom-apparatus will reveal 
that collapse has taken place, unless extreme measures (e.g. shielding, extreme cooling) 
have been taken to prevent pointer-information from leaking out of the system. A similar 
line of reasoning may provide a third answer to question (QJ , again due to lemma {JJ : 

Answer 3 // information leaks from the pointer into the outside world, col- 
lapse unavoidably takes place on the combination of system and measurement 
apparatus. In practice, global pointers constantly leak information. 

2.3.2 Information Leakage in General 

A Macroscopic system may be modelled by some local àlgebra £P = <8)j=i If this 
system is interacting normally with the outside world, (the occasional photon happens to 
scatter on it, for instance) then one may imagine a number of very global observables being 
measured continually, with a certain measurement uncertainty 5 o. For a decent definition 
of this 'measurement uncertainty' or 'quality' a, I will have to refer to chapterEl definitions 
lli lli and 11211 . But it entails the o~\ and 02 of lemma {JJ being < a for eigenstates of X . 



5 According to proposition 1151 . simultaneous perfect measurement is not possible, but global observ- 
ables normally commute well enough for the limits of accuracy imposed by proposition 1161 to remain 
well below the macroscopic scale. 
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This enables us to apply lemma @ . It telis us that all coherences between eigenstates 
ip xi and ip X2 of global observables X are continually vanishing on all of @, (the pointer, 
e.g. a beam of light, is outside the system), provided their eigenvalues x% and X2 satisfy 
\xi - xz\ > 2cr. 

Take for example a collection of N spins, @ = ® i=1 M2. Suppose that for a = 
x, y, z, the observables S a — J^iLi °a are continually being measured with an accuracy 
iV _1 <C o -C 1. For N in the order of Avogadro's number, N ~ 6 x 10 23 , this allows for 
extremely accurate measurement. Then between globally different eigenstates of S a , i.e. 
states for which the eigenvalues \s a ~ § a \ ^> a, coherences are constantly disappearing. 
However, the measurement need not have any effect 6 on states which only differ on a 
small scale. Take for instance p ® |+) and p ® |— ), with p some state on N — 1 spins. 
Indeed, \s a — s a \ < 2/N <C cr, so lemma {JJ is vacuous in this case. 

We see how a produces a smooth boundary between the macroscopic and the micro- 
scopic world: global processes (involving S' a -differences ^> a) continually suffer from loss 
of coherence, while local processes (involving «S' Q -differences <C cr) are unaffected. 



6 Of course it is possible for a measurement to destroy coherence between p® |+) and p (g |— }. Just 
take any old measurement, and add a 'decoherence operation' by hand. The net result is a measurement 
destroying coherence. 
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2.4 Conclusion 



In order for a second observer to notice the lack of collapse after a first observer has 
measured X £ sé with a global pointer, he or she must do the following: 

- Keep the original system sé and the system SS, containing the first observer, from 
interacting with the outside world, for instance by shielding or extreme cooling. 

- Then perform a measurement on an observable X E sé % 2$ which does not com- 
mute with the global pointer: it cannot be in the original system sé, it cannot be 
very global and it cannot be very local. If the first measurement is repeatable, it 
cannot lie entirely in SS either. 

- The outcome of the first measurement must remain unknown to the second observer. 
Suppose that an observable in the second observer serves as a pointer to the first 
measurement. The net situation would then be a measurement of X S sé by an 
observer outside of sé ® 3$. A second measurement will then always show collapse 
on sé ® 3$. 

Under these circumstances, it cannot be excluded that coherences are experimentally 
detected by the second observer. 

Paradox 

All of this leaves us with one glaring paradox: surely the first observer, remembering the 
outcome of A-measurement, can perform A-measurement and observe that there is no 
collapse, let alone reduction, contradicting proposition 10? 

The answer is no. The first observer cannot simultaneously measure A and remember 
a value of A. We will elaborate this in proposition 12(ill . where we will have some more 
tools at our disposal. 
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Chapter 3 

Measurement Inequalities 



There are many different concepts of measurement. Up until now, we have investigated 
mappings 5P{sí} — ► 5^{sé ® S$) of the form M*(p) = cf(p® r) for some automorphism 
a of sé ® SS and r € 5?{0). But other points of view are possible. For instance, von 
Neumann (see |Neu| ) and Holevo (see [Holj) define measurement as an affine mapping 
from 5?{sé} to «5^(^(0)), the space of probability measures on the Borel er-algebra in 
SlcR. 

In order to cover all concepts of measurement at the same time, we will investigate 
completely positive operations. With their help, we will define perfect measurement 
and we will define unbiased measurement. We will then rigorously define the quality of 
unbiased measurement 1 . 

After this, the way will be cleared for quite general statements on the trade-off between 
measurement quality and the amount of disturbance 2 . 

3.1 Completely Positive Operations 

What do we expect from any physical operation T* : y{.etf) — > from one (quantum) 

probability space into another? We formulate three natural requirements (copied from 
jMÜ): 

- The stochastic equivalence principle is common to all interpretations of postulates 
and <|2J|. It states that a system that is in state p\ with probability Ai and in state 
P2 with probability A2 cannot be distinguished from a system in state Aipi + A 2 p 2 . 
Therefore, T* must be affine: for all < Ai, A 2 < 1 such that Ai + A 2 = 1, 

XiT*( Pl ) + A 2 T*(p 2 ) = T*(Àipi + A 2P2 ) 

lr The attentive reader may have noticed the grotesque ugliness of proposition 0. This results from 
the ad hoc use of |M*(p)(Q) — p(P)| as a measure of quality. 

2 Although propositions and proofs will be different from the ones encountered before, their interpre- 
tation will be similar if not the same. In order not to disturb the flow of reasoning, l've chosen to once 
again go over details exhaustively mentioned before. My apologies to the reader. 
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T* can thus be extended to a linear mapping between the full duals of sí and 33, 
so that T* is the dual of a linear map T : 3è — > jé ' . This justifies our notation: 
T*(p) = poT. 

- In order for T* to map states to states, it must respect normalization and positiv- 
ity: T*(»(I) = 1 and T*{p)(B^B) > V B e 3è, pe S?{sé). Equivalently, 
T(I) = I and B > => T(B) > 0. 

- So T* is linear, normalized and positive. But it was realized by K. Krauss in the 
1970's that it must be possible to couple sí and 3è to another system ^ and perform 
the operation T* on sí, leaving c € untouched. This leads us to the last requirement 
(see |Kraj ). 

An operation T* is called n-positive if the map id*<g)T* : 5?{M n ®3g) -> S?{M n ®8Í) 
defined by r ® p i— > r ® (T*(p)) is linear, normalized and positive. An operation is 
called completely positive if it is n-positive for all n G N. 

There exist positive operations which are not completely positive. Formulating the above 
in the Heisenberg picture, we define a linear, continuous map id„<g>T : M n ®3$ — > M n ®sí 
by id„<g> T(A <g> B) = A® T(B) for all A € M n and Bel 

Definition 6 (Complete Positivity) Let sí and 3§ he unital C*-algebras. A linear 
map 33 sí is called completely positive if 3 T(I) = I and if for all D £ M n ® 38 ', n £ N: 

D>0 => id„(g)T(D) > 0. 

The class of completely positive operations was invented to encompass every physical 
operation you could ever want. For example, it contains all automorphisms, *-homo- 
morphisms and states, as well as dilations to automorphisms. A positive operation from or 
to an abelian àlgebra is automatically completely positive, and it hardly needs mentioning 
that both concepts of measurement mentioned above are completely positive too. 

We will proceed to investigate completely positive operations. It is surprising how 
much can be said about so general an object. 



3 We assume all completely positive operations to be automatically unital: T(I) = I. This is not always 
so in the literature. 
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3.2 A Cauchy-Schwarz Inequality 

If sí ', 88 are C*-algebras, one can define a sesquilinear map Ft : 88 x 88 —> sé ' . 

Deflnition 7 Let T : 83 — > sí be a 4-positive unital operation. Let A,B € 88. Then 

F T (4 B) = T(^B) - T(^)T(B). 

If no confusion is possible, we will often omit the subscript. F is a sesquilinear positive 
semidefinite sí-valued form on 88, i.e. 

- F is linear in the second argument, anti- linear in the first. 

- F(A, B)t = F(B, A) for all A,B&88. 

- F(B, B) > as an operator inequality for all B £ 8§. 

The first and second point follow immediately from T(Ay = T(A'). We will derive this 
and the third point shortly, along with an sí-valued Cauchy-Schwarz-inequality. There is 
also a fourth point of interest, clear from the definition: 

- F(I, B) = F(B, I) = VBef 

The likeness of F to an inner product incites us to introduce a semi-norm on 8$: 

Definition 8 (T-norm) Let T : 88 — > sí be a 4-positive unital operation. Let B £ 88. 
Then 

||B||t^HVFt(B,B)||. 

||-B||t is called the T-norm of B. Since T is always a contraction, we have ||-B||t < 
Of course it is possible that ||-B||t = for B / 0. If, for example, T happens to be a 
C*-homomorphism, then Ft is identically zero and ||-B||t = for all B £ 8è. Ft is in 
many ways a measure of how well T respects multiplication. 

Real and Imaginary Part 

Like any element of sí we can split Ft(A,B) into a Hermitean and an anti-Hermitean 
part: 

F T (A B) = 3?F T (A, B) + &F T (A, B) 

with 

23fJF T (A,B) = T^B + BU) - (T(A)tT(S) + T(S) f T(A)) 
2i%F T (A,B) = T(^B - BU) - (T(i)*T(B) - T(B)*T(i)) . 

The commutator of A and B is defined by [A, B] = f AB — BA. The anti-commutator by 
{A, B} + = AB + BA. In case of Hermitean A and B the above boils down to: 

23RF T (A,B) = T({A,B}+) - {T(A),T(B)}+ 
2i$sF T (A,B) = T([A,B\) - [T(A),T(B)\. 

On Hermitean A and B, ïïFt(A B) indicates how well T respects the anti-commutator, 
while 9Ft(^4, B) indicates how well it respects the commutator. (SFt is identically zero 
on the Hermiteans if and only if T is a Lie-algebra homomorphism.) 
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A Cauchy-Schwarz Inequality 

This sesquilinear, positive semidefinite Ft allows for an s^-valued Cauchy-Schwarz 4 in- 
equality: 

Lemma 11 (C*-Cauchy-Schwarz inequality) Let T be a four-positive unital opera- 
tion SS — > si . Then for all A, B S 2$ , we have in operator ordering: 

F T (A,B)F T (B,A) < \\F T (B,B)\\F T (A,A). 

Proof : 

First we prové that T(A)^ = T(A^) and that F(A, A) > VA e SS- Since T 
is two-positive on id2 ® T is positive on M 2 ® 3$, and we see that 

id 2 ^T(( f \ ll>0 



or 

T(AU) T(At) 
T(A) I 



> 



in the operator ordering. In particular it must be Hermitean so that T(A)^ = 
T(A t ). For each X > 0, also Y^XY > for any Y. Making a convenient 
choice for Y : 

I 
—T(A) 



Y 

we obtain 



T(A*A) - T(A)tT(A) 




> 



so that F(A, A) > for all two-positive T. Now since T is four-positive on J?, 
id 2 <S> T is again two-positive on M 2 ® Consequently, making a convenient 
choice of 'A' in M 2 ^: 

FufagT^Q o)'(o >>^° V ABe 
Working out this expression explicitly: 

Fid2OT (( )'( 

., // \ / A B 

= ld ^ T (( 5 t ){ 

id 2 «T[[ £ !lk^ M B 



fit / / 11 



4 Reminiscent of the Cauchy-Schwarz inequality for Hilbert C*-modules (see |La,n| V but not quite the 
same: the result is identical, but the conditions differ. 
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T(B^A) T(B^B) ) \ T(Bt)T(A) T(fít)T(B) 

F(A, A) F(A, B) 
F(B, A) F(B, B) 

> 0. 

Once again using X > => Y^XY > 0, this time with 



Y = 



I 
-F(B,A) 



we obtain, proceeding as above: 

F(A, A) - 2F(A, B)F(B, A) + F(A, B)F(B, B)F(B, A) > 0. (3.1) 

In the case that F(B, B) = 0, we need to prové that F(A, B)F{B, A) = 0. 
Now F(NB, NB) = for N E N. Applying inequality Ò to A and NB, we 
see that 

N 2 F(A,B)F(B,A) < F(A,A) 

for all N, so that F(A,B)F(B,A) = 0. In the case that F(B,B) ^ 0, we 
rephrase ineauaritv l3.1l as 

F(A, B)F(B, A) < F(A, A) + F(A, B)(F(B, B) - l)F{B, A). 



So, putting B' = B/\\ y/F(B,B)\\ and noting F(B', B')-I< 0: 

^m^B)^ ~ F{A A) + F{A B '^ B '' B 'ï - l ) Y (B',A) < F(A, A) 
yielding the required expression. 

q.e.d. 



Covariance and Uncertainty 

We shall give a few easy corollaries to clarify the nature of F. First of all, it resembles 
the covariance of a state. Classically, a probability distribution P on a probability space 
(0, 93, P) induces a covariance on pairs of random variables o, b: 

Cov P (a, b) = Ep(ab) - Ep(a)Ep(b). (3.2) 

Where Ep is the expectation with respect to P. In a quantum probability space, observ- 
ables are not represented by random variables, but by Hermitean elements of a C*-algebra. 
If one chooses to represent the product observable of A and B by (AB + BA)/2, one can 
generalize ll.'i.2ll to arbitrary C*-algebras. 

Definition 9 (Covariance) Let p S S?{9$). Let A, B S 9S Hermitean. Then the co- 
variance of A and B in p is defined by: 

cov p (A B) = p{\{AB + BA)) - p(A)p(B) . 
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Of course there is no conflict with proposition (01 for commuting A and B. From the 
C*-Cauchy-Schwarz inequality, we now have two easy corollaries. The first is a Standard 
result, known as the 'covariance inequality': 

Corollary 11.1 (Covariance Inequality) Let p G ^{0), Then for all Hermitean 
A,B G 3§: 

|cov p (A,5)| 2 < var p (A)var p (B). 

The second Standard result is known as the 'Heisenberg uncertainty relation': 

Corollary 11.2 (Heisenberg Inequality) Let p G S8(3S). Then for all Hermitean 
A,B G 3§: 

i 



{^f)\ <var p (A)var p (i?). 



In particular, if A = ihd x and B = x, It follows 5 that ua^b > h/2, We prové both 
corollaries at the same time: 
Proof : 

A state p on 3? is just a completely positive map 3S — > C. So we can form F p 
and note that 

3fEF p (A B) = \ (F P (A, B) + F p (B, A)) = cov p (A, B), 



ZF p (A,B) = ±(F p (A,B)-F p (B,A))=p 



[A,B] 
2i 



Therefore, both \cov p (A, B)\ 2 < \F p {A,B)\ 2 and \p{ [ A^)\ 2 < \F p {A,B)\ 2 . 
The two corollaries above now follow from the C*-Cauchy-Schwarz inequality: 

\F P {A,B)\ 2 < \F p {A,A)\\F p (B,B)\=v a x p {A)var p {B). 

q.e.d. 



In words, corollary Ijll.ljl is the real part of the C*-Cauchy-Schwarz inequality, corol- 
lary H11.2fl its imaginary part. 



Multiplication Theorems 

Up to this point, we've used a state p to construct F p , but we will encounter F descen- 
dant from more general positive operations 38 — > sí later on. The C*-Cauchy-Schwarz 
inequality was inspired by a 'multiplication theorem' due to R. Werner (see |Wer| ): 

Corollary 11.3 (Multiplication Theorem) Let T he a four-positive unital operation 
38 -> sí. Let Be 38 such that \\B\\ T = 0. Then for all A G 38: 

F T (A,B) = F T (B,A) = 

i.e. T(A^B) = T(A) f T(S) and T(B^A) = T^T^). 

5 Since d x and x are not bounded, we are not allowed to apply the C*-Cauchy-Schwarz inequality 
directly. The statement is true nonetheless. 
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The proof is immediate from the following generalization, the 'almost multiplication the- 
orem'. 

Corollary 11.4 Let T be a four-positive unital operation 38 —* srf . Let B G 38. Then 
for all A g 38: 

||Ft(A,B)||<||A||||B|| t . 

Proof: 

For any A, B g 38 we have by the C*-Cauchy-Schwarz inequality Ft(A B)Ft(B, A) < 
F T (A,A)||B|j2,,socertamly ||F T (A,B)|| < ||A|| t ||B||t- But since < F(A, A) < 
T(AU) < ||A|| 2 I, we also have ||A|| T < \\A\\. The corollary follows. 

q.e.d. 

This is the form of the C*-Cauchy-Schwarz inequality we will utilize most often. 
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3.3 Quantum Measurement 



With the help of the C*-Cauchy-Schwarz inequality we will investigate quantum measure- 
ment. But we will first define it. 

3.3.1 Introduction 

We will distinguish perfect and unbiased measurement. The former is a special case of 
the latter. 

Perfect Measurement 

In order for an operation to be a measurement, it must transport information from X 7 
the observable to be measured, to Y, the pointer-observable. Observation of Y in state 
M* (p) must be equivalent to observation of X in state p: 

Deflnition 10 (Perfect Measurement) Let X £ sí ' , Y £ SS be Hermitean. A perfect 
measurement of X with pointer Y is by deflnition a completely positive map M : §§ — > sé 
such that 

In the Heisenberg picture, this makes M^y) an injective *-homomorphism: 

Proposition 12 Let M : S§ — > sí be completely positive. Let X € sí , Y G SS be 
Hermitean. Then M is a perfect measurement of X with pointer Y if and only if 

Spec(X) = Spec(Y) and M(/(F)) = f(X) V/ € ^(Spec(F)). 

Proof : 

If f p ,x = Pm*(p),y, then they certainly live on the same measure-space: 
Spec(X) = Spec(F). By the proof of proposition flj, V P: x = Pm*( p ),y 
iff their expectation vàlues on / € ^ (Spec(y)) are the same. This is so for 
all p € y(sí) iff p(M(/(y))) = p(f(X)) \/p e .y{s/), or equivalently, iff 
M(/(Y)) = f(X). 

q.e.d. 

In particular, a pointer Y measures only one X = M.(Y). 
Unbiased Measurement 

We shall broaden our view to include 'measurements' that do not transfer the entire 
probability distribution from X to Y, but only the average. 

Deflnition 11 (Unbiased Measurement) Let X e sí, Y € $è be Hermitean. An 
unbiased measurement M of X with pointer Y is by deflnition a completely positive map 
M : 38 -> sí such that 

M*(p)(Y) = p(X) \/pey{sí) or equivalently M(Y) = X. 
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Equivalence is easily established with (K&R, p. 257]. Observing Y in state M*(p) results 
in the same average as observing X in state p. But the probability distributions need not 
be the same. 

Take any operation T : 3§ — > srf . Take any Y 6 £3. Then T is automatically a 
measurement of T(Y) with pointer Y. Unbiased measurements are not hard to find. 

Quality of Unbiased Measurement 

Next, we define the quality cr of this unbiased measurement: 

Deflnition 12 (Quality) Let M : SS — > íé be an unbiased 6 measurement of X with 
pointer Y . Then its quality a is defined by 

o 2 d = f sup{ var M , (rt (r) - var p (X) \ p G 5?{sé) }. 

So a telis us how much the uncertainty of the measurement result maximally exceeds the 
unavoidable amount of uncertainty inherent in the state p. In particular, when measuring 
an eigenstate of X, the variance in the measurement outcome will be less than or equal 
to cr 2 . It is intuitively clear (and we will prové shortly) that a 2 > 0: the uncertainty of p 
in X is inherent and can never be diminished by some clever choice of measurement. 

Lemma 13 The quality a of an unbiased measurement M : SS — > sé with Hermitean 
pointer Y g S§ is ||^||m. 

Proof : 

If M is a measurement of X, then X equals M.(Y). This is apparent from 
p(M(Y) -X) = Q for all p € S?{sf). Now for all p £ y(sé) 

var M . (p) (r)-var p (M(r)) = 
= (m*(p)(Y 2 )-(M*( P ))(Y) 2 ) 
= p(M(Y 2 )) - P (M(Y) 2 ) 
= P(F(Y,Y)) 

so that 

a 2 = sup{ p(F(Y, Y))\pe .YÇs/) } = \\Y\\ 2 m . 
This proves the assertion, as well as the positivity of cr 2 . 

q.e.d. 

From this, it is clear that if a measurement is perfect (in the sense of definition 
then it has optimal quality: a = 0. One need only apply proposition Ü12II with f(x) — x 2 . 
It is time to look at some examples of unbiased measurement. 

6 In order to claim complete generality, we could abandon the demand that a measurement be unbiased: 
we would then introduce a maximal bias e = sup{ |M* (p)(Y) — p(X)\ \ p 6 = ||M(Y) — X\\. 

This would make any completely positive operation M : 38 — > s/ a measurement of any X £ srf with any 
pointer Y í SS and with a certain quality a and maximal bias e. The interested reader may consider it a 
home exercise to adapt the estimates to come, adding e's along the way. Good luck. 



-(p(M(Y) 2 )-p(M(Y)) 2 ) 
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3.3.2 Examples of Unbiased Measurement 

1. A direct 7 observation of X £ sé, denoted M : "íf (Spec(A)) — > sé is defined by 
/ i ► f{X). In the dual (Schròdinger) picture, M* maps S'{sé) to J^(%í(Spec(A))), 
the probability distributions on Spec(A). It is completely positive. According to 
proposition QJ, M*(p) = P p ,x- If y in ^(Spec(A)) is the random variable y(A) = A, 
then M is a perfect measurement of X with pointer y. Its quality is therefore a = 0. 

2. Indirect observation is also measurement: If M* : S?{sé) — > 5?{sé ® 88) is defined 
by M*(p) = a*(/3(g)r) for some automorphism q of .2/ ® 88 and for some r £ 5?{38), 
then it is completely positive. If M*(p)(Y) = p(X) for all p £ y{£?), then it 
is an unbiased measurement of X with pointer Y. It is perfect in the sense of 
definition ifÏQj) if and only if it is perfect in the sense of nagelTTl 

3. Each automorphism is completely positive. If a is an automorphism of £F such that 
a(Y) = X, then it is a perfect measurement of X with pointer Y. Its quality is 
automatically a — 0. 

4. Let U C K be compact in the Euclidean topology, and let sé be some von Neumann 
àlgebra. A 'Positive Operator Valued Measure' (POVM) (see poll p. 51]) is defined 8 
as a mapping M from the Borel-measurable subsets of U into sé satisfying: 

- M(0) = 0, M(U) = 1 

- M(V) > as an operator inequality for all measurable V C U. 

- Suppose { Vj | j £ J } is a countable decomposition of V , then M(V) — 
J2je.j M(Vj), where the sum converges in the weak sense. 

By integrating bounded measurable functions y°°(U) on U over the POVM, M 
may be extended to a unital, positive operation M : Jz?°°(í7) — > sé. In short: 
M(/)= / f(x)M(dx). It is completely positive due to the commutativity of j£f°°(í7). 
Thus, we have an unbiased measurement of / xM(dx) with pointer / : x i— > x and 
quality cr 2 = || f x 2 M(dx) - (J xM(dx)) 2 )\\. The POVM is projection-valued iff 
cr = 0. It then reduces to the direct observation of example (QJ. 

5. Davies (see |Davl ch. 3]) also adopts the POVM as measurement. He gives a par- 
ticularly nice example of an unbiased position measurement: Let M 1 = L 2 (M.), and 
sé = 8§(Jéif). Then the position observable X is defined by {Xip){x) = xip(x). 
X has spectral measure V t— > P(V), defined by (P(V)ip)(x) = lv(x)ip(x). 

Let / be a probability density on K with zero mean. For Borel-sets Fel, M(V) is 
defined 9 by M(V) = f_ (f *Iv)(x)P(dx) . f has the effect of blurring the outcome. 
In the limiting case that / is the Dirac <5-distribution on zero, M(V) = P{V). 

M turns out to be a measurement of the position observable X indeed: 

J xM(dx) = X. Furthermore, Davies shows that the quality of M is exactly the 

r.m.s. value of /: a 2 = Var(/). 

7 This kind of measurement is often called 'von Neumann measurement'. We will reserve this designa- 
tion for example ISlinstead. 

8 In IDavI and IHoll even for non-compact U. 

9 / * g is the convolution of / and g. Explicitly, (/ * g){x) = f^° x f(y)g(x - y)dy. 
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6. Suppose that the observable being measured has discrete spectrum, X = ^VajjPj, 
Then there exists a so-called von Neumann measurement N : ^(Spec(X)) ® sé — > sí . 
It is defined by f®A^> J2i f(i)'PiAP l . Constrained to ^(Spec(X)) (g)I, it reduces 
again to direct observation. But N also gives information about how the system is 
left behind. 

N is a perfect measurement of X with pointer p ® I (where = Xi) and quality 
ít = 0. The state N*(p)|^ ( g ) ][ is exactly the collapsed state of p after X-measurement: 
N*( P )(A ® I) = P Ç£. P.AP,) = C*(p)(A), 

Note that if [A, X] = for some A 6 jz/, then A = P^AP^ since it also commutes 
with the spectral projections. Consequently, N(A(g)l) = A and N*(p)(A<g)I) = p(A). 

7. Let M : M 2 ® C 2 — > M 2 (C 2 are the 2 x 2-diagonal matrices) be defined by 

M{A®D) = duXjAXi 

i=0,l 

with the matrices (0 < e < 1/2) 



\/i J V VÏ^e 

M 2 ® C 2 is isomorphic to M 2 © M 2 . There M reads 

M(A @B) = X^AXo + X\BX X 

From which one can verify complete positivity of M. It is an unbiased measurement 

of a z with pointer (1 — 2e) -1 I<g) diag( 1, —1 ) and quality a ~ -^5r^, For e = 0, 
M reduces to the (perfect) von Neumann measurement of exampleEI For e = 1/2, 
it has become completely useless: It produces a random outcome on C 2 , unrelated 
to the measured object. We'll come back to this example after proposition l|19JI , 

8. There are also silly examples of measurement: The identity is a perfect (a = 0) 
measurement of any observable X with pointer X. 

9. Any completely positive operation is a perfect (er = 0) measurement of I with 
pointer I. 

Definition 11 lli even seems to admit operations one would not call measurement. In spite 
of its generality though, much can be said about unbiased measurement and its quality. 
We shall give some examples of this, leaning heavily on lemmas l!13ll and fiU - 



3.3.3 Structure of a Perfect Measurement 

But before that, we will exploit the fact that any completely positive operation M acts 
as a homomorphism on the elements of M-norm 0. For any Hermitean H we will denote 
by ^(H) the C*-algebra generated by I and H. 
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Lemma 14 Let M : 3$ — ► srf be a 4-positive operation, let B £ 3§ be Hermitean. Among 

1. ||B||m=0. 

2. M is an isomorphism tf(B) -> <#(M(B)), 

3. Spec(B) = Spec(M(B)) and M(f(B)) = f(M(B)) for all f £ (Spee(B)) . 
I \\f(B)\\ M = for all f € ^ (Spec(B)) . 

5. M maps the relative commutant B' into M(-B)'. 
The following relations hold: 

Proof : 

O =** © : ^{B) is the norm-closure in ^ of the collection of polynomials in B. 
Similarly, <r ^'(M(_B)) is the norm-closure in sé of the polynomials in M(_B). By 
the multiplication theorem HIOI) . we see that M.(B n ) = M(£?)". From this and 
linearity, one verifies that M(p(£>)) = p(M.(B)) and ||j>(-E>)||m = for all polynomi- 
als p. 

Now let Q € ^{B). By the Weierstrass theorem, there exist polynomials p n such 
that Pn(B) — ► Q in norm. Since M is automatically norm-continuous and since it 
maps tfÇB) densely into ^(M(B)), we can verify that M(Q) g <g(M(B)): 

M(Q) = Hm M(p„(B)) = Hm Pn (M(B)) e íf(M(B)). 

Similarly, ||Q||m = since 

M(Q 2 ) = M( lim pl(B)) = hm M(p£(B)) = lim pl(M(B)) 

= (jdm p„(M(B))) 2 = (M(g)) 2 . 

This means that the restriction of M to ^(B) is a C*-isomorphism: if Q S ^{B), 
then even M(QA) = M(Q)M(A) for any i € ^ by the multiplication theo- 
rem Its image is therefore automatically norm- closed (see (K&R, p. 242]), 
and thus equal to ^(M(B)). 

(|2j) => © : By the canonical isomorphism / i— > /(-B) known as the Gel'fand trans- 
form, ^{B) is isomorphic to ^ (Spec(B)), the C*-algebra of continuous functions 
on Spec(B) equipped with the supremum norm (see }KERl p. 271]). Similarly, 
c é' (M(B)) ~ "íf (Spec(M(B))). M acts as a continuous isomorphism mapping poly- 
nomials p on Spec(_B) to the same p on Spec(M(£>)) since M(p(£?)) = p(M.(B)). 
By continuity of M.\^m) an d '^■~ 1 W('M(B)), polynomials converge on Spec(B) if 
and only if they do on Spec(M(B)). Since the spectra are closed they must be the 
same, and by continuity of M and the theorem of Stone- Weierstrass (the polyno- 
mials form a norm-dense set in the space of continuous functions) , we now see that 
each continuous function is mapped to itself. 
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® => ® : Let / G ^(Spec(S)). Let = /(x) 2 . Then by © we have M(f(B) 2 ) = 
M(g(B)) = g(M(B)) = f(M(B)) 2 = M(/(i?)) 2 . 



Q Q : Trivial: choose f(x) = x. 

QJ) © : Suppose that A G i?', i. e. [A, B] = 0. Then by the multiplication theo- 
rem IllOjl . 

[M(B),M(A)} = M([A, B]) - [M(A),M(B)] = 2iF m (A, B) = 0. 

q.e.d. 

It is clear that perfect measurement (in the sense of definition IjlOJl l satisfies a — 0. But 
combining lemma ifTljl with proposition ltÏ2ll . we also obtain the converse: 

Corollary 14.1 Unbiased measurement (in the sense of definition ) is perfect (in the 
sense of definition llfl\) ) if and only if it has quality a = 0. 

Which is the moral obligation of any definition of quality. In the particular case of von 
Neumann algebras: 

Corollary 14.2 Let sí and be von Neumann algebras, and A G sí , Hermitean. 
Suppose M : 98 — > sí measures X with pointer Y and quality a — 0. Then M is also 
a perfect measurement of all spectral projections P(V) of X , with pointer Q(V), the 
corresponding spectral projection ofY. 

Proof : 

For any Borel set V and for any p G Y{sí), we have seen that F p ,x{V) — 
^M'( p ),y(V)- In particular, for all normal states p, this means that p(P(V)) = 
M*(p)(Q(V)), or p(P(V) - M(Q(V))) = for all normal states p. There- 
fore M(Q(V)) = P(V). Automatically, \\Q(V)\\ M = 0, since M(Q 2 (V)) = 
M(Q(F)) =P(V) = P 2 (V). 

q.e.d. 

I hope that the paragraph above has given some credibility to our definitions of mea- 
surement and quality. They will form the basis of the rest of this thesis. 

3.3.4 Simultaneous Measurement 

As an appetizer, we'll use the C*-Cauchy-Schwarz inequality to generalize a well-known 
theorem. Perfect simultaneous measurement (i.e. measurement of two observables using 
two commuting pointers) can only be performed on commuting observables. The math- 
ematical formulation below is based on Werner (see |2<fer|), but the physical statement 
was known long before, see e.g. |Nen| . 
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Proposition 15 (Joint Measurement) Let M : SS — > »e/ be a perfect (<jy = ||Y||m = 0, 
°y — II^IIm = 0) measurement of both X € sé and X € sé with commuting pointers 
Y,Y e SS respectively. (All Hermitean.) Then 

[X, X] = 0. 

For example, let M* be an affine map from y(sé) to the space of probability distributions 
on Spec(JÍ) x Spec(X) such that P p ,x and P p ^ are the marginal probability distributions 
of M*(p). By the proof of proposition the space of probability distributions on a 
probability space fí can be identified with ■y( ( é'(fl)), the state-space of the C*-algebra 
^(íi). Due to the abelianness of ^(íl), M* must be the dual of a completely positive 
map. It is therefore a joint measurement in the sense of proposition l(Ï5|l . We see that the 
kind of mapping constructed in proposition @, vital to the interpretation of quantum 
mechanics, simply does not exist if [X, X] ^ 0. 

We'll prové proposition II 1511 along with a Heisenberg relation-like generalization 10 , 
relating the product of both measurement qualities with the lack of commutativity. 

Proposition 16 (Generalized Joint Measurement) Let M : SS — » sé be an unbiased 
measurement of X S sé and X 6 sé , both Hermitean, with commuting Hermitean pointers 
Y,Y £ £§ respectively. Then for the qualities oy = ||Y||m o,nd a Y = ||Y||m the following 
relation holds: 

2a Y a Y > \\[X,X}\\ . 

Proof: 

\\[X,X}\\ = ||M([y,y])-[M(Y),M(y)]|| = ||23(F(Y,y))|| < 2||Y|| m ||Y||m = 
2aya Y , proving both propositions 11511 and llllit . 

q.e.d. 



10 |Hnll p. 90] already gives a generalization for POVM's. However, this involves only Var M » ( p ) (Y) 
instead of Var M *( p )CK) — Var p (X), thus staying much closer to the Heisenberg uncertainty relations. 
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3.4 The Heisenberg Principle 

The so-called Heisenberg principle 11 may be formulated as follows: 

When an outside observer extracts quantum-information from a system, it is 
impossible to leave all states unaltered. 

As for a mathematical formulation and proof, due to R. Werner (see |War| ): 

Proposition 17 (Heisenberg Principle) Let M : sé ® 3è — > sé be an unbiased mea- 
surement of any Hermitean with any Hermitean pointer I <B> Y £ I (g> Sè. Suppose 

that M leaves states on sé undisturbed: M*{p){A ® I) = p{A) VA e sé V/5 € 5? {sé). 
Then X is in the centre of sé. 

Proof: 

In circumstances above, M(A (8 1) = A for all A 6 sé. Therefore also 
M(At A ® I) = AU for all A e -ef. Thus ||A <g> I||^ = 0, which entails 
[M(A <g> I), M(I <g> F)] = 0, i.e. [A, X] = for all A in sé. 

q.e.d. 

So the only information (of any quality) that can be obtained without disturbing the 
original system is information about central elements. 

We have good reason to consider 'central information' as 'classical information': In the 
fully classical case, the àlgebra sé is abelian. All observables are central, so all information 
is freely accessible. In the archetypal quantum case however, the àlgebra sé in question is 
£${<•%?), the àlgebra of bounded linear operators on some Hilbert space Jíf. In this case, 
I is the only central element (modulo C), so that no information can be gained without 
disturbing the system. 

3.4.1 Global Generalization 

So we have grounds to examine the norm distance of X to the centre, d(X, 3?) = 
inf{ || X — Z\\ | Z S One may think of d(X, 3?) as quantifying the amount of 'quan- 

tumness' in X , as it determines the maximal amount of non-commutativity with X in the 
sense below: 

Lemma 18 Let X be a Hermitean element of a finite- dimensionat von Neumann àlgebra sé . 
Then d(X, 2f) is the smallest number c such that || [A, X] || < 2c || A|| for all A e sé. 

Proof: 

Finite or infinite dimensional àlgebra, it is clear that for any A € sé, \\ [A, X] \\ = 
|j [A, X — Z]\\ < 2||A||||A - Z\\ for any Z e 2? {sí). Taking the infimum over 
Z, we obtain ||[A, A]|| < 2d{X, 3?)\\A\\. This means that we are finished if we 
find A € sé, A jí such that \\[A,X]\\ = 2d{X, &)\\A\\. 

l·1 Not to be confused with the Heisenberg uncertainty relations in corollary Ifll. 21 . 
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Now for any von Neumann àlgebra sé, there exists a so-called decomposition 
over the centre (see |K&RI ch. 14]). For finite-dimensional von Neumann 
algebras, this simply means that sé is isomorphic to M ni 8 ... 8 M 1lk for 
some k;ni, . . .nk <E N, where M ni is the àlgebra of rn x m matrices. So 
X = Xi(B ■ ■ -(B Xk with Xk £ M nk . Each Xk can be brought in diagonal form 
by some unitary transformation Uk- So by the isomorphism U\ 8 . . . 8 [/&, we 
may think of X as 

diag{\x^),. . . , A (lini) ) 8 . . . 8 diag(X {k ,i), \k,n h )) 

with A( ij i) > . . . > A(í jn< ) the eigenvalues of Xi in decreasing order. Let 
n = | (A(j i i) — A(j inj )J be the spectral radius of Xj. Let íj = i [\i t i) + A(j, ni )). 
Let 

1 d = f X - (íil © ... © í fe I) 

= diag . .,A(i )7ll )) © ... © rfiací ^A( fe| i),. . • , A( fel „ fe )J 

with A(j .i) = Vi, \u >n A = — r,-. X and X differ only by the central element 
t\l © ... © ífel € Furthermore, ||X|| = max^ j-j. Take that maximal r, and 
isolate the highest and lowest eigenvectors 

Offi...ffiOffiV+,-ffiOffi...ffiO 

in C" 1 8 . . . 8 C n " . Construct 

A = © . . . 8 8 (l^+KV'-l + l^-XV'+l) © 8 ... 8 

in .e/. Now, since I-X£ 5°, we see that 

[X, A] = [X,A] = 8 .. . 8 8 2r l (|^+)(^_| - |^-)(^+|) 8 08 ... 8 0. 

So = \\[X,A]\\ = 2n = 2||X||||A||, But since already d(X,3T) < \\X\\ 

and \\[X,A]\\ < 2d(X,3r)\\A\\, we see that 

2d{X,&)\\A\\ < 2||X|||m| = \\[X,A]\\ < 2d(X, 3T)\\A\\. 

So ||jf|| =d(X t 2r) and \\[X,A]\\ = 2d(X, 3T)\\A\\. 

q.e.d. 

We are now looking for generalizations of proposition ifTTjl of the following form: suppose 
you allow some (small) disturbance of the states on the original àlgebra. How much 
information can be gained maximally? Note that in the proposition below, all that is 
used about the pointer is that it commutes with sé ® I. 
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Proposition 19 (Generalized Heisenberg Principle) Let sé be a finite- dimensiono! 
von Neumann àlgebra with centre 3? , Let 38 be an arbitrary von Neumann àlgebra and 
let Y £ 3§ , X £ sé Hermitean. Let M be completely positive sé ® 3$ — ► sé such ihat: 



M(I<g> Y) = X and ||I® Y|| M 
pointer I ® Y and quality cr. 



cr, i. e. M is an unbiased measurement of X with 
M*(p)\^ & - p\\ < A Vp e 5?{sé) for some < A < 1. 



Then 



a > d{x, gry 



/3A 



Proof : 



We move to the Heisenberg picture: ||M(A ® I) — A|| < A||j4|| because 
p(M(A (g) I) - A)< A||A|| for all p <E y(sé). For notational convenience, 
we introduce an operation T : sé — > sé defined by T(A) = M.(A ® I) and the 
map D : sé -> sé defined by T>{A) = T(A) - A. The former is the effect of 
measurement on the measured system sé, the latter satisfies ||D(A)|| < A||A||. 
Since T)(A) j; 'D(A) > 0, we may estimate 



\Ai 



2 

M 



= \Ft(AA)\\ 

< | F T (A,A) +D(A) t D(A)|| 

= |T(AU)-T( J 4 t )T(A)+D(yl) t D(y4)|| 

= | (D(AU) + AU) - (D(A) + A)t(D(A) + A) + D(A)tD(A)|| 

= | (D(^tyl) - D^U - ^D^H 

< 3A||A|| 2 . 



Since A < 1, T must be injective because it is linear and because 

T(A)=0 ||A|| = ||D(A)|| < A\\A\\ => ||A||=0. 

Since sé is finite-dimensional, this implies that T is onto. Furthermore, 
\\T(A) - A\\ < A\\A\\ implies |||T(A)|| - ||A||| < A||A|| and hence \\A\\ < 
||T(A)||/(1 — A). By the C*-Cauchy-Schwarz inequality, we deduce 

\\[X,T(A)}\\ = ||[M(l0Y),M(A®I)]|| 

< 2||(I®y)|| M ||(A®I)||M 

< 2cr\/3A||A|| 
/3A 



< 2a- 



1 



\T(A)\\. 



So, since T is onto, crf^ is anumber c such that \\[X, A]\\ < 2c\\A\\ \/A e sé . 



By lemma lfÏ8jl . d(X, Sf) is the smallest such number. Thus d(X, 2f) < &j_ A 

q.e.d 
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If A = 0, the last line of the proof reduces to proposition I|17J1 for finite dimensional 
algebras: unbiased measurement is only possible if X is central. 

If A 7^ 0, proposition l|19Jl says that a > d(X, ^)^^- given a non-central X € sé to 
be measured with maximal disturbance A. Then the attainable measurement quality a 
is worse than d{X,3T)^. We see that a becomes deplorable if A is lowered to zero. 

For example, let's look again at the unbiased measurement M : M2 ® C2 — > M%, dis- 
cussed in exampledon page0^1 Explicit calculation shows that M satisfies the conditions 

of proposition for A = 1 — 2^J e(í — e). It yields the estimate a > ^^J 3 g^/ ^ ) 

whereas the real quality of M equals 2 V^?, e ^ . In this case (and probably in general) the 
estimate is rather crude 12 . But it does contain some general features of the curve <r(e), 
notably lim e | 1 er(e) = 00 and lim e | er(e) = 0. 

3.4.2 Local Generalization 

We have extended the Heisenberg principle by demanding that all states on s/ are per- 
turbed in norm only slightly instead of not at all. Another way of 'extending' it is by 
demanding that all states are left exactly in place, but only with respect to some obser- 
vables: 

Proposition 20 (Generalized Heisenberg Principle) Let X £ sí ' , Y € SB be Her- 
mitean. Let M ; sé ® 2$ — » sé be an unbiased measurement of X with pointer I ® Y and 
quality a. Let ^ A 6 sé with \\[X,A]\\ = S\\A\\ be such that 

M*(p)(A®ï) =p(A) VpeJ^OO. 

Then 

a > S/2. 

Proof: 

Of course M(A ® I) = A. By the C*-Cauchy-Schwarz inequality, 

S\\A\\ = ||[X,^]|| ^|[M(I®Y),M(A®I)]-M([I®Y,A®I])|| 
= ||2i3?F M (]I® Y,A®I)|| < 2||I®y|| M ||A®I|| M < 2o-||A||. 

q.e.d. 

For example, let sé = (g) i=1 JVÍ2- Let a x ,cry,a z be the Pauli spin-matrices in M2, and 
denote by a l a the observable 1®...®!®^®!®...®!. Let X = jj J2i=i a x an d 
A = jj'^2!i=i a y be the average spin in the x- and y-directions. Then ||[X,yl]|| = -^||j4||, 
so that any measurement of X leaving A untouched automatically has quality a > -h, 
Accurate average spin measurement in all directions simultaneously is only possible in 
large systems. 

12 Note that the hideous V3A comes from the estimate \\A ® I||m < v3A|[A|[. This is the part of 
the proof where the crudeness comes in: In this particular example \\A ® I||m < (1 — 2e)||A|| \/A 6 sé. 
Taking the proof of proposition 1191 from there would yield the true a as an estimate. 
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3.5 State Reduction and Collapse 



Until now we have only looked upon the Heisenberg principle from one side: given a certain 
amount of disturbance, how much information can one gain from a system? On the flip 
side, we may consider the following question. Given a measurement of a certain quality, 
how does this perturb the system? In case of a perfect measurement (a = || V||m = 0), 
lemma itTUl and its corollaries give fairly detailed restrictions on the structure of M. For 
example, M has to map Y' into M(y)'. In the Schròdinger picture, this translates into 
a particularly nice answer to the above question known as 'state collapse'. 

3.5.1 State Reduction 

Let p be a state on sé ', and Ie/ In defmition <QJ, we have deflned the state px on sé 



Let sé and 3$ be von Neumann algebras, and let M : 3$ — > sé be a measurement of 
X G sé with pointer Y G SS and quality a = 0. Let P(V) be the spectral projections 
of X, Q(V) those of Y. In corollary (114. 2|) . we have seen that M measures perfectly 
the spectral projections of X with the corresponding ones of Y: M(Q(F)) = P(V) and 
||Q(10I|m = 0- Under these circumstances, and if both p and M are normal (i.e. weakly 
continuous), we have seen 2 examples of the above defmition: 

- We may look at the normal state M*(/j) resulting from measurement. An observer 
?3Y may condition all its observations on the pointer outcome V. On pageEl the 
conditioned probability is shown to be induced by the state (M* {p))q(v) : 



- If any perfect measurement of Y yields outcome in V, the system is experimentally 
known to be in state p-p(y) prior to measurement. This is called the reduced state. 

We prové proposition Q in the completely positive setting, showing that the commuting 
diagram on page remains vàlid for all completely positive perfect measurements M: 

Proposition 21 (Reduction) Let M ; £$ — > sé he a measurement of X G sé with 
pointer Y G SS and quality a = 0. Then for all states p G sé : 



by 



Px(A) 



p(X^AX) 

P {x\x)- 



([BmW] \[YinV\)= P(m-( p )) 



tB {[BmW}). 



(M*p) Y =M*(p x ). 
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Proof : 



Writing out the definitions, we need to prové that for all A € sé: 

p(M(Y^AY))p(Y^Y) = p(X*M(A)X)p(M(Y*Y)). 

Since ||F|| M = 0, M(Y+Y) = X^X. By the multiplication theorem ifTTÏÏjl . 
M(F^ AY) = X^M.(A)X . Letting p act on the above proves the assertion. 

q.e.d. 

Taking Q(V) for Y and P(V) for X in the above proposition, (M*(p))q(v) = M*(pp(v)): 
reducing a measured state according to pointer outcome and measuring a reduced state 
amounts to the same thing. In particular, for [B, Y] = 0, 

P M . Wl B,y {[B in W] | [Y in V}) = P ( m*(pp (V )),b (i B ^ W\ )■ 

If you measure X with pointer Y and register an outcome in V, then direct observation 
of all B S c € will be as if, prior to measurement, the system had been in the reduced 
state p-pn/). 

Of course this is also true for all indirect observations, as long as the outcome is not 
erased from the original pointer Y. If, after M*, an operation N* takes place, leaving the 
pointer Y and its projections untouched, then we may simply apply proposition l(2Ï|l to 
N* o M* instead of M*. This explains why state reduction p i— ► pp(v) is observed by 
as long as observes the outcome X in V indirectly. 

3.5.2 State Collapse 

Note the essential difference between conditioning on the outcome [X in V] and forming 
the X-reduced state: the former can only be done on X' , the latter on all of sé ' . Let 
X have spectral measure V i— » P(V). On X' , p is a harmless classical superposition of 
X-reduced states: 

=^ /) (P(F i )4) =^p(P(F»)iP(K)) =^ /) (P(7 i )) /3p(Vi) (4) \/A e X'. 
ii i 

As far as A £ X' is concerned, a system in state p is simply in state p-p(vú with probabi- 
lity p(P(Vi)). On page|Hl we have introduce the collapse operation C : sé — ► sé, defined 
by C(A) = J2i P (Vi) AP (^)- Then C*(p) is known as the collapsed state: C*(p) = 
EiP(P(Vi))p P (v t) . 

When you remove the restriction A G X' , p is not just the classical superposition of its 
reduced states. The difference between p and C*(p) on sé is experimentally 13 observable, 
e.g. by two-slit-experiments or along the lines of pageEI 

In summary, a state p is only a classical superposition of the X-reduced states p-pívA 
on X' , the commutant of X. But proposition l(2ÏTl shows that after measurement the 

13 Of course, reduction on classical (central) observables is the exception to the rule. For a central 
observable Z, the relative commutant Z' equals sé ', and any state may at any time be safely considered 
a classical superposition of Z-reduced states. 
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tables are turned: M*(p) is a classical superposition of the M*(pp(y 4 )) on Y' , not X' . In 
other words, M*(p) = M*oC*(p) on Y' . This can also be seen directly from the structure 
of M. 

Proposition 22 (Collapse) Let sé , SS be von Neumann algebras. Let M : SS — > sé be a 

perfect (a = 0) measurement of a Hermitean X £ sé with pointer Y G S$. Let V i— > P(V) 
be the spectral measure of X . Let {Vi \i G 1} be a countable decomposition ofSpec(X). 
Then a collapse operation C ; sé — > sé is defined by C(A) = f J^i P(Ví)AP(Ví) . In this 
situation, we have for any p G 5^ {sé): 

M*(p)=M*oC*(p) on Y'. 

Proof : 

M maps Y' into X' , and C leaves X' pointwise fixed. Therefore, if B G Y' , 
wehaveM*oC , (/})(B)=p(CoM(5)j = p(M(B)) =M*(p)(B). 

q.e.d. 

The above proposition states that when you measure X in state p, and then restrict 
attention to the commutant of the pointer, then the system will behave as if it had been 
in the collapsed state C*(p) prior to measurement. It generalizes the diagram on na,gelÏ5l 
to all completely positive perfect measurements. 

A measurement of X with pointer Y is called repeatable if immediate repetition of 
the measurement would yield the same result, i.e. if M*(p) = p on ^{X). Then ||X||m 
must be 14 0, so according to the definitions M is also a perfect measurement of X with 
pointer X. According to the above proposition then, M*(p) equals M*o C*(p) not only 
on Y' , but also on X' . 

For example, if M : sé ® 38 — > sé measures X with pointer I ® Y in a repeatable way, 
then the distinction between states that are collapsed or intact prior to measurement 
can, after measurement, neither be made by observables of the form A ® I nor of the 
form I £g) B. 

3.5.3 Generalized State Collapse 

Very well. For perfect (er = 0) measurements of X with pointer Y, the essence of state- 
collapse is that M maps Y' into X' . The way to think of X' is the following: in a 
finite-dimensional àlgebra, X can be decomposed into projections as X = ^ XíPí. And 
of course M(B) = PiM{B)P j for any B G Y', Now M(B) G X' means M{B) = 
X) í P(M(_B)Pí; then M.(B) contains only diagonal blocks, like 
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14 Since M(/(X)) = f(X) for all continuous / on Spec(JÍ), this is certainly true for f(x) = x 2 , whence 
F(X,X) = 0. 
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And of course this is just the disappearance of coherences between eigenstates of X with 
different eigenvalues. In this light, an approximate collapse proposition would have to be 
something that says how small the 'off-diagonal blocks' of M(S) get, provided that B 
commutes rather well with the pointer Y, and that the quality of measurement a is not 
too bad. In other words: a Heisenberg-equivalent of lemma 0. 

Proposition 23 (Generalized Collapse in the Heisenberg-Picture) Let srf be a 

von Neumann àlgebra. Let M : SS — > s/ be an unbiased measurement of X € s/ with 
pointer íef (boih Hermitean) and quality a . Let Spec(X) D V i— » P{V) denote the 
projection valued measure belonging to X. Suppose B G S$ is a Hermitean element such 
that \\[Y,B]\\ = S\\B\\. Then 



\\P([x,x + e])M(B)P([y,y + e] 



< 



2a 



f - y\ 



\B\\ 



Proof : 



By the C*-Cauchy-Schwarz inequality, 



\[X,M(B)]\\ = ||M([K,B]) + 2i9íP(fl,y)|| 

< ||[y,B]|| + 2||y|| M ||B||M 

< (S + 2a)\\B\\. 



(3.3) 



In order to dehorrify our formulas, we introduce some notation. First of all, 
we define M n = « + ne for u, £ E, n £ Z/2. Secondly, P Ui „ = P([u n , 
And finally, we define X u = J2n£Z u n+^u,n- This is an approximation of X 
by a step function operator, so that 



\X-X U \\ <e/2 



(3.4) 



This leads us to 



\X x M(B)-M(B)X y \\ = 
= || [X } M(B)] + M(B)(X - X y ) -(X- X X )M(B)\\ 

< (í + 2cr)||B|| + e||M(B)|| 

< (í + e + 2«T)||fl||. 



(3.5) 



Now for any 2 projections P and Q and for any A in sé ', we have ||PAQ|| < 
|P|| ||.4|| IIQH < \\A\\. We use this in the second step below. In the fírst step, 
we use that P u , n Pu,o — SnfiPufi for n e N. And in the third step, we make 
P„ „ = I. We then obtain 



USe of X^ngZ x u,n 



y\\\P([x,x + e])M(B)P([y,y + e})\ 



x,0 



< 



)P x>n M(B)P y , 



y m+i )P x , n M(B)P i 
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With inequality 113. 5|) . we now have what we wanted. 



(3.6) 



q.e.d. 



Even if S = 0, the norm distance between M(_B) and X' does not go to zero as a J, 0. 
Suppose thai X has a continuous spectrum. If a ^ 0, however small, we can always 
choose a; and y in Spec(X) so that \x — y\ < 2a. The above proposition then becomes 
trivial. It allows for large off-diagonal elements as long as they are close to the diagonal. 

This is physically relevant: suppose that the internal energy X of a block of iron 
is measured with an accuracy a of few microjoules. Clearly this measurement does not 
produce decoherence between energy-states inside the àtoms, i.e. eigenstates with energies 
x and y differing several eV. Indeed, the estimates 'kick in' only if the energy difference 
approaches the quality of measurement: \x — y\ ~ er. 



Almost Classical Observables 

Col·lapse with respect to a central observable X is meaningless. This has nothing to do 
with measurement whatsoever: since the spectral projections P(V) of a central observable 
X are central, we see that C*(p)(A) = p P(V5)AP(V5)) = /o(Ei P 0^) = PÍ A )- 
Thus C*(p) = p for all states p. 

This can be generalized for almost classical observables X, i. e. observables for which 
d{X, 2?) is small. 

Proposition 24 Let srf he a von Neumann àlgebra with centre Let A,X e sd ', X 
Hermitean. LetP(V) be the spectral projections of X . Then 

\\P([x,x + e])AP([y,y + e})\\ < " + f d{X f } \\A\\. 

\ x y\ 

Proof : 

Pretty much the same as that of proposition (|23f . Under the assumptions 
above, inequality Í3.4II remains vàlid, as does i't.fill with M(i?) replaced by A. 
Inequality Q3.3J1 is replaced by 

\\[X,A]\\<2d(X,3r)\\A\\ (3.7) 

and {Hm by 

\\X x A-AX y \\ = \\[X,A]+A(X-X y )-(X-X x )A\\ 

< {2d(X, J p ) + e)||A||. (3.8) 

g.e.d. 

The same caveat as before applies: if d(X, 3?) is non-zero, then off-diagonal blocks close 
to the diagonal can remain large, so that no bound for \\p — C*(p)|| is obtained. 
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3.5.4 Generalized State Reduction 



In view of proposition (1211 . it is tempting to speculate that for measurements with 
good quality (a 1), perhaps also ||(M*(p))y — M*(px)\\ <C 1 for all p. Alas, na- 
ture is cruel and hard: consider again the Ef h example on page 23 M (also) measures 



X = ^ ^ q 6 with pointer Y" = I® f J JJ j. One easily calculates a = e(l — e). 

But, taking for p the spin-down state with density matrix ^ 1 / ' ^ Ure 
out (identifying AÍ2 <8 C2 with M2 ® M2) that (M*(p))y is represented by the density 



. / o o\ /o o\ . , / A / o \ „ 

matrix I 1 ) ® l / S ^ px ' V e / V 1 — e /' (j0nsec l uem ' r y 



||(M*(/o))y — M*(px)|| = 1 — e. Thus, by choosing e small, it is possible to have mea- 
surements with <t <§C 1 yet ||(M*(p))y — M*(px)|| ~ 1- In the example above however, 

the ratio varM *i»( y ) equals 1 for all e. And smallness of this ratio does force 

var M*(p)(. r ; 

an approximate reduction, as we will see below. Once again it is not the quality a 'an 
sich' that regulates reduction, but the quality divided by the typical variations in pointer 
outcome, cf. proposition (l23l) . 



Proposition 25 (Generalized reduction) Let X G sé, Y G 3§ be Hermitean. Let 
M : 3§ — > sé be an unbiased measurement of X with pointer Y. Then 

linvr*^ M*ín \\\<o / var M'(p)(^)^var p (A) / / var M »( P )(y)-var P (A) \ 

||(M (p)) Y -M (px)\\<2^ M * (p)(rtY -) l 1 + V WÏpÏïyW) )■ 



Proof: 



Brutally applying the C*-Cauchy-Schwarz inequality would get the job done. 
That is to say it yields \\(M*(p)) Y - M*(p x )\\ < 3\\B\\ J )(yfy) . Partly 

because I don't like the numerator being independent of p (allowing M „^ yty ^ 

to blow up whereas V ^ M m» ) (p)(y ™ ) " ~ ^ s n i ce ly bounded by 1) and partly to 
keep you from dozing off, we'll go about it another way 15 for a change. By the 
GNS-representation (see |K& Rj[n. 278]), we may assume sé to be an àlgebra 
of operators on some Hilbert space Jíf p , with p a vector state ip p . By the 
Stinespring theorem (see jTakl p. 194]), we may assume 3S to be an àlgebra 
of operators on some Hilbert space M, and the existence of a contraction 
V : -> ffl such that M is of the form M(B) = V^BV. Then M*p is a 
vector state with vector Vip p , since M*(p)(B) — (ip p \V^ BV\ip p ). In the proof 
of lemma (|Ï3l) , we have seen that 

var M .( P )Cn - var p (M(F)) = p(F(Y,Y)). 
If we introduce the notation W = V^V), we have 



p(F(Y,Y)) = (ViP p \YÏW 2 Y\V*l> p ) = \\WYViJj p \\ 2 



16 I was put on this track by Màdàlin Gutà, who suggested a simple proof of the C*-Cauchy-Schwarz 
inequality in the case of completely positive maps. 
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The rest is hardly exhilarating: for any B £ 38, 

M*p(Y^BY) - p(M(YÍ)M(B)M(Y)) = 
= ( V ipp | F f BY - Y f V* VBV^ VY \ Vtp p ) 
= (V^ P \Y^W 2 BY + Y^BW 2 Y - Y^W 2 BW 2 Y\V^ P ) 
< 2\\B\\\\YV4> P \\\\WYVil> p \\ + \\B\\\\WYV^ P \\ 2 . 



So, since 



we see that 



M*(p)(Y J< Y) = p(M(Y)^M(Y)) + \\WYVip p 



\\(M*(p))Y(B)-M*(p mY) )(B)\\ = 

M* p(YÏ BY)p(M(Y)ÏM(Y)) - p(M(Y)^M(B)M(Y))M*(p)(Y^Y) 



p(M(y)tM(y))M*(p)(yty) 

p(M(Y)iM(Y)) (M*p(Y^BY) - p(M(y)+M(B)M(y))) 



p(m(y )t M(y)) m* (p) (y ty ) 
p(M(y)tM(B)M(y)) || v^yy^pll 



p(M(y)tM(y )) m* (p) (y ty ) 



< 2IIBI 



21151: 



II^VvIl 



+ 2IIBI 



\\WYVjj p f 
\\YV4> P \\ 2 



/var M . ( P) (y) - var p (M(y))_ var M » (p) (y) 



var 



,(M(y)) 



M*(p)(yty) 



M*(p)(yty) 



q.e.d. 



Since < var M . ( p )(y) < M* (p)(Y^Y), we may also write down a weaker version, starring 
the ratio varM '(p)^ Y ~> discussed above: 

Corollary 25.1 Let X £ gf , Y £ 38 be Hermitean. Let M: 38 — > sí be an unbiased 
measurement of X with pointer Y £ 38 such that var M «( p )(y) ^ 0. Then 



/var M .( P )(y) - var p (X) 




var M « ( P )(y) - var p pf) 



y var M . (p) (y) ^ y var M . (p) (y) 

If M is a measurement with outcome or 1, i.e. Y is a projection, then automatically 
var M *( p )(y) = p(l — p) if p is the probability of measuring outcome 1. From this and 
var M *( p )(y) — var p (X) < cr 2 , we obtain another corollary. 

Corollary 25.2 Let X £ .sé be Hermitean. Let M; 38 — > sé be an unbiased measurement 
of X of quality a which only allows outcomes and 1, i.e. the pointer Y is a projection. 
Then for all states p with probability p of measuring outcome 1: 



||(M*(p)) y -M*(p x )||<2 



1 + 



Vpí 1 - p) \ Vp( 1 -p) 
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3.6 A Paradox Resolved 



Imagine the following thought experiment. The universe is described by the àlgebra 
^ = sé <g> 8S. An observer ^ C 8è contains two separate pointers Y\ and Y<i. (One may 
think of a computer memory consisting of 2 classical bits, for example.) A perfect mea- 
surement M : — > sé ® 8iï is performed on A e ,e/ using Y\ as a pointer. (Information is 
stored in the first bit.) Since all time-evolution is automorphic, M must have as dilation 
some automorphism a of 3>. Then there must still be observables D £ sé ® 2% on which 
no collapse occurs. 

Having learnt the outcome of the first measurement, ^ performs a second perfect 
measurement N : ^ — > sé <g> ^ but now on D, using Y 2 as pointer. (This information is 
stored in the second bit.) Comparing information stored on Y\ with that on Y 2 , ^ has 
solved the riddle of reduction once and for all: 

- Either there is a full and objective reduction after the first measurement, and the 
state of S jumps into an eigenstate of X. 

- Or all time evolution is automorphic, and purity on £> is conserved 

The difference cannot be seen on observables commuting with Yï, but it can be seen on D. 
(Un?)fortunately, such a crucial experiment is not possible: apply the next proposition 
toaoN. 

Proposition 26 Suppose Y\, Y% £ 8è are Hermitean elements such that [Yï, Y 2 ] = 0. Sup- 
pose M: — > sé <g> 8$ is a measurement of D £ sé <g> 8$ with pointer Y2 such that 
M(Yi) =I®Yi. Then 

[D,1<8>Yí] =0. 

Proof : 

02 = ||Y 2 || M =0, soO = M([Y 1 ,Y 2 ]) = [M(Y 1 ),M(Y 2 )} = [I®Y U D] 

q.e.d. 

It is true that a measurement can be performed on D £ (I ® Yl)'. But this necessarily 
erases the information that was gained on X from the pointer Yl. 
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Epilogue 



The subject of quantum measurement is particularly susceptible to misunderstanding. 
I would therefore like to clarify (perhaps superfluously) my view on the so-called 'mea- 
surement problem' in relation to the interpretation of quantum mechanics ventilated on 
page0 The problem of measurement is commonly defined as follows: 

How and when do observables take one particular value out of all the possibil- 
ities allowed by quantum mechanics? 

On pageElas well as on page^Hl I have briefiy sketched some of the problems one would 
have to overcome when answering this question. I do not make any attempt to do so. In 
my mind, a more relevant question seems to be: 

How and when is one particular value of an observable observed by one par- 
ticular observer, out of all the possibilities allowed by quantum mechanics? 

In order to consider this question, one needs a theory with mathematical represen- 
tatives of both the primitive notion of 'observable' and of 'observer'. Observables are 
commonly modelled by Hermitean elements. This seems rather sensible to me. But how 
to model an observer? 

In my mind, the key property of any 'observer' is that it is able to directly observe a 
number of observables. I therefore represent an abstract 'observer' by the set ^ C 9 of all 
observables which it can detect directly. Since an observer can construct sums, products 
and limits from the observed vàlues of observables in fé*, it seems plausible that ^ is a 
C*-algebra. And since simultaneous observation of observables in c € necessarily induces a 
map of the form discussed in the example following proposition lllóll , c é> may only contain 
commuting observables. ^€ is an Abelian C*-algebra. 

One would like to assign a value to each D £ £>. This cannot be done in a consistent 16 
manner. But with the interpretation on page consistency is only necessary within 
Abelian algebras. It suffices to have a random generator do the following: 

- At time 0, choose an Abelian ^ C 2. 

- Assign vàlues to all Hermitean C S c € in a consistent manner, according to the joint 
probability measures induced by p. These are the vàlues observed by at time 0. 

18 At least if °J = SS{3ff) with dim(J€ > 2). See |KfcS] . 
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- Repeat this for all possible abelian fé 7 C @. Quantum mechanics does not prescribe 
joint probability distributions for non-commuting observables, so we have some free- 
dom in our choice of random generator. The procedure need not be independent for 
different observers: there may well be some consistency in their observations. But 
according to fkX S . full consistency is impossible in general. 

Now the question 'What value of C £ c € is observed by ^ at time í?' is answered 
as follows. Look at the Abelian àlgebra at(fé') at time 0. The Hermitean observable 
at(C) gets assigned a value in Spec(a t (C)) = Spec(C) with probability distribution 
IPp.a t (C) — Pa*(p),C" This is the value of C observed by ^€ at time t. 

This is a deterministic procedure: at time 0, the random generator determines what 
each observer gets to observe at time t. But observations are not objective. If A € H 
and A £ ff, it may well be that A gets different vàlues with ^ and c €: different observers 
observe different vàlues of the same observable at the same time. From this, you see 
that the same observer ^ may also observe different vàlues of the same C at different 
times. The trick is to show that these observations are always consistent with all other 
observations of the same observer at the same time, and that they allow an observer 
to store information about the world around it. This can be done entirely within the 
framework of (quantum) probability theory, without any reference to the nature of the 
random generator, c.f. page^l propositionEl proposition[2]and nagelfiOl 

This is my way to intèrpret these propositions. But I would like to emphasize for one 
last time that the interpretation above is merely a tool. Any consistent interpretation 
is just as good as any other. The structure of nature is engraved in mathematics, and 
interpretations only serve to tie abstract structure to daily experience. This is why I have 
chosen to be brief in explanation and tedious in calculation. In particular, it explains why 
the above exposition is mufHed away in this epilogue. 
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